peninState 


4£F 


COLLEGE  OF  INFORMATION  SCIENCES  AND  TECHNOLOGY 

THE  PENNSYLVANIA  STATE  UNIVERSITY 


Running  Behavioral  Experiments  with  Human  Participants:  A  Practical  Guide 


Frank  E.  Ritter,  Jong  W.  Kim,  and  Jonathan  H.  Morgan 
frank.ritter@psu  .edu  jongkim@psu  .edu 

12  December  2009 
revised  20  January  2010 


Phone  +1  (814)  865-4453  Fax  +1  (814)  865-5604 


College  of  1ST,  Information  Sciences  and  Technology  Building,  University  Park,  PA  16802 


Report  Documentation  Page 

Form  Approved 

OMB  No.  0704-0188 

Public  reporting  burden  for  the  collection  of  information  is  estimated  to  average  1  hour  per  response,  including  the  time  for  reviewing  instructions,  searching  existing  data  sources,  gathering  and 
maintaining  the  data  needed,  and  completing  and  reviewing  the  collection  of  information.  Send  comments  regarding  this  burden  estimate  or  any  other  aspect  of  this  collection  of  information, 
including  suggestions  for  reducing  this  burden,  to  Washington  Headquarters  Services,  Directorate  for  Information  Operations  and  Reports,  1215  Jefferson  Davis  Highway,  Suite  1204,  Arlington 

VA  22202-4302.  Respondents  should  be  aware  that  notwithstanding  any  other  provision  of  law,  no  person  shall  be  subject  to  a  penalty  for  failing  to  comply  with  a  collection  of  information  if  it 
does  not  display  a  currently  valid  OMB  control  number. 

1.  REPORT  DATE 

20  JAN  2010  2' REPORT  TYPE 

3.  DATES  COVERED 

00-00-2010  to  00-00-2010 

4.  TITLE  AND  SUBTITLE 

Running  Behavioral  Experiments  with  Human  Participants:  A  Practical 
Guide 

5a.  CONTRACT  NUMBER 

5b.  GRANT  NUMBER 

5c.  PROGRAM  ELEMENT  NUMBER 

6.  AUTHOR(S) 

5d.  PROJECT  NUMBER 

5e.  TASK  NUMBER 

5f.  WORK  UNIT  NUMBER 

7.  PERFORMING  ORGANIZATION  NAME(S)  AND  ADDRESS(ES) 

Pennsylvania  State  University, College  of  Information  Sciences  and 
Technology, Information  Sciences  and  Technology  Building, University 
Park, PA, 16802 

8.  PERFORMING  ORGANIZATION 

REPORT  NUMBER 

9.  SPONSORING/MONITORING  AGENCY  NAME(S)  AND  ADDRESS (ES) 

10.  SPONSOR/MONITOR’S  ACRONYM(S) 

11.  SPONSOR/MONITOR’S  REPORT 
NUMBER(S) 

12.  DISTRIBUTION/AVAILABILITY  STATEMENT 

Approved  for  public  release;  distribution  unlimited 

13.  SUPPLEMENTARY  NOTES 


14.  ABSTRACT 

There  are  few  resources  providing  practical  guides  on  how  to  prepare  and  run  experiments  with  human 
participants  in  a  laboratory  setting  at  colleges  and  universities.  In  our  experience,  we  have  found  that 
undergraduate  students  are  taught  how  to  design  experiments,  and  how  to  analyze  experimental  data  in 
courses  such  as  Design  of  Experiments,  Statistics,  etc.  On  the  other  hand,  the  dearth  of  materials  available 
to  students  regarding  either  preparing  or  running  experiments  has  led  to  a  significant  gap  between  theory 
and  practice  in  this  area,  which  is  particularly  acute  outside  of  psychology  departments.  Consequently, 
labs  frequently  must  not  only  impart  these  skills  to  students  but  also  address  misunderstandings  arising 
from  this  divorce  of  theory  and  practice  in  their  formal  education.  We  present  here  a  short  book  that  can 
help  students  to  run  experiments  effectively  and  more  safely  with  human  participants.  In  this  book,  our 
purpose  is  to  provide  hands-on  knowledge  and  actual  procedures  of  experiments.  We  hope  this  book  will 
help  undergraduates  in  psychology,  engineering,  and  the  sciences  to  run  studies  with  human  participants  in 
a  laboratory  setting.  This  will  particularly  help  students  who  are  not  in  large  departments,  or  are  running 
participants  in  departments  that  do  not  have  a  large  or  long  history  of  experimental  studies  of  human 
behavior.  We  are  generally  speaking  here  from  our  background  running  cognitive  psychology,  cognitive 
ergonomics,  and  human-computer  interaction  studies.  Because  it  is  practical  advice,  we  do  not  cover 
experimental  design  or  data  analyses.  This  practical  advice  will  be  less  applicable  in  more  distant  areas  but 
may  be  still  of  use.  For  example,  we  do  not  cover  how  to  use  complex  machinery,  such  as  a  fMRI  or  ERP. 
We  also  do  not  cover  field  studies  or  studies  that  in  the  US  require  a  full  IRB  review.  This  means  that  we 
do  not  cover  how  to  work  with  unusual  populations,  such  as  prisoners,  animals,  and  children,  or  how  to 
take  and  use  measures  that  include  risks  to  the  subjects  or  to  the  experimenter  (e.g.,  saliva,  blood  samples, 
or  private  information).  We  have  addressed  this  book  toward  advanced  undergraduates  and  early 
graduate  students  starting  to  run  experiments  without  previous  experience;  but  we  believe  this  guide  will 
be  useful  to  anyone  who  is  starting  to  run  research  studies,  training  people  to  run  studies,  or  studying  the 
experimental  process.  When  running  an  experiment,  insuring  its  repeatability  is  of  greatest  importance?it 
is  critical  to  address  variations  in  either  method  or  in  participant  behavior.  Running  an  experiment  in 
exactly  the  same  way  regardless  of  who  is  conducting  it  or  where  (e.g.,  different  research  teams  or 
laboratories)  is  essential.  In  addition,  reducing  variance  in  the  participants?  behavior  is  key  to  an 
experiment?s  repeatability.  This  book  will  help  you  achieve  these  requirements,  increasing  both  your 
comfort  and  that  of  the  participants 


15.  SUBJECT  TERMS 

16.  SECURITY  CLASSIFICATION  OF: 

17.  LIMITATION  OF 
ABSTRACT 

Same  as 
Report  (SAR) 

18.  NUMBER 
OF  PAGES 

69 

19a.  NAME  OF 
RESPONSIBLE  PERSON 

a.  REPORT 

unclassified 

b.  ABSTRACT 

unclassified 

c.  THIS  PAGE 

unclassified 

Standard  Form  298  (Rev.  8-98) 

Prescribed  by  ANSI  Std  Z39-18 


Running  Behavioral  Experiments  with  Human  Participants:  A  Practical  Guide 


Frank  E.  Ritter,  Jong  W.  Kim,  and  Jonathan  H.  Morgan 
frank.ritter@psu.edu  jongkim@psu.edu  jhm500 1  @psu.edu 

College  of  Information  Sciences  and  Technology 
The  Pennsylvania  State  University 
University  Park,  PA  16802 

12  December  2009 

Abstract 

There  are  few  resources  providing  practical  guides  on  how  to  prepare  and  run  experiments  with  human 
participants  in  a  laboratory  setting  at  colleges  and  universities.  In  our  experience,  we  have  found  that 
undergraduate  students  are  taught  how  to  design  experiments,  and  how  to  analyze  experimental  data  in 
courses  such  as  Design  of  Experiments,  Statistics,  etc.  On  the  other  hand,  the  dearth  of  materials 
available  to  students  regarding  either  preparing  or  running  experiments  has  led  to  a  significant  gap 
between  theory  and  practice  in  this  area,  which  is  particularly  acute  outside  of  psychology  departments. 
Consequently,  labs  frequently  must  not  only  impart  these  skills  to  students  but  also  address 
misunderstandings  arising  from  this  divorce  of  theory  and  practice  in  their  formal  education. 

We  present  here  a  short  book  that  can  help  students  to  run  experiments  effectively  and  more  safely  with 
human  participants.  In  this  book,  our  purpose  is  to  provide  hands-on  knowledge  and  actual  procedures  of 
experiments.  We  hope  this  book  will  help  undergraduates  in  psychology,  engineering,  and  the  sciences  to 
run  studies  with  human  participants  in  a  laboratory  setting.  This  will  particularly  help  students  who  are 
not  in  large  departments,  or  are  running  participants  in  departments  that  do  not  have  a  large  or  long 
history  of  experimental  studies  of  human  behavior. 

We  are  generally  speaking  here  from  our  background  running  cognitive  psychology,  cognitive 
ergonomics,  and  human- computer  interaction  studies.  Because  it  is  practical  advice,  we  do  not  cover 
experimental  design  or  data  analyses.  This  practical  advice  will  be  less  applicable  in  more  distant  areas, 
but  may  be  still  of  use.  For  example,  we  do  not  cover  how  to  use  complex  machinery,  such  as  a  fMRI  or 
ERP.  We  also  do  not  cover  field  studies  or  studies  that  in  the  US  require  a  full  IRB  review.  This  means 
that  we  do  not  cover  how  to  work  with  unusual  populations,  such  as  prisoners,  animals,  and  children,  or 
how  to  take  and  use  measures  that  include  risks  to  the  subjects  or  to  the  experimenter  (e.g.,  saliva,  blood 
samples,  or  private  information). 

We  have  addressed  this  book  toward  advanced  undergraduates  and  early  graduate  students  starting  to  run 
experiments  without  previous  experience;  but  we  believe  this  guide  will  be  useful  to  anyone  who  is 
starting  to  run  research  studies,  training  people  to  run  studies,  or  studying  the  experimental  process. 

When  running  an  experiment,  insuring  its  repeatability  is  of  greatest  importance — it  is  critical  to  address 
variations  in  either  method  or  in  participant  behavior.  Running  an  experiment  in  exactly  the  same  way 
regardless  of  who  is  conducting  it  or  where  (e.g.,  different  research  teams  or  laboratories)  is  essential.  In 
addition,  reducing  variance  in  the  participants’  behavior  is  key  to  an  experiment’s  repeatability.  This 
book  will  help  you  achieve  these  requirements,  increasing  both  your  comfort  and  that  of  the  participants 
who  participate  in  your  experiments. 


li 


This  book  consists  of  seven  sections  with  eight  appendices.  We  concisely  describe  below  each  section’s 
contents.  We  hope  you  find  it  relevant  and  useful. 

Section  1,  Overview  of  the  Research  Process,  describes  briefly  where  experiments  fit  into  the  research 
process.  If  you  have  taken  either  an  experimental  methods  course  or  a  research  design  course,  you  can 
skip  this  chapter.  If,  on  the  other  hand,  you  are  either  a  new  research  assistant,  or  are  working  on  a 
project  in  which  you  are  unclear  of  your  role  or  how  to  proceed,  this  chapter  may  provide  some  helpful 
context. 

Section  2,  Preparation  for  Running  Experiments,  describes  pertinent  topics  for  preparing  to  run  your 
experiment — such  as  supplemental  reading  materials,  recruitment  of  participants,  choosing  experimental 
measures,  and  getting  Institutional  Review  Board  (IRB)  approval  for  experiments  involving  participants. 

Section  3,  Potential  Ethical  Problems,  describes  ethical  considerations  necessary  for  safely  running 
experiments  with  human  participants — i.e.,  how  to  ethically  recruit  participants,  how  to  handle  data 
gathered  from  participants,  how  to  use  that  data,  and  how  to  report  that  data.  Being  vigilant  and  aware  of 
these  topics  is  a  key  component  to  rigorous,  as  well  as  ethical,  research. 

Section  4,  Risks  to  Validity  to  Avoid  While  Running  an  Experiment,  describes  risks  that  can 
invalidate  your  experimental  data.  If  you  fail  to  avoid  risks,  you  may  obtain  either  false  or  uninterruptible 
results  from  your  experiment.  Thus,  before  starting  your  study,  you  should  be  aware  of  these  risks  and 
how  to  avoid  them. 

Section  5,  Running  a  Research  Study,  describes  practical  information  about  what  you  have  to  do  when 
you  run  the  experiments.  This  section  will  give  an  example  procedure  that  you  can  follow. 

Section  6,  Concluding  a  Research  Session  and  Study,  describes  practical  information  about  what  to  do 
at  the  conclusion  of  each  experimental  session. 

Section  7,  Example  Research  Studies,  describes  example  experimental  studies  that  give  you  a  brief 
synopsis  of  procedural  steps.  You  can  indirectly  experience  a  real  example  of  an  experiment.  We  include 
several  examples,  forms,  and  checklists  as  appendices.  These  forms  will  vary  by  lab  and  IRB,  but 
provide  examples  of  the  style  and  tone. 


Acknowledgements 

Preparation  of  this  manuscript  was  partially  sponsored  by  a  grant  from  the  Division  of  Human 
Performance  Training,  and  Education  at  the  Office  of  Naval  Research,  under  Contract  # 

W91 1QY-07-0 1-0004.  The  views  and  conclusions  contained  in  this  report  are  those  of  the 
authors  and  should  not  be  interpreted  as  representing  the  official  policies,  either  expressed  or 
implied,  of  the  U.S.  Government  or  the  Pennsylvania  State  University. 

Ellen  Bass,  Richard  Carlson,  Karen  Feigh,  Alex  Kirlik,  and  Razvan  Orendovici  have  provided 
useful  comments,  but  incompleteness  and  inadequacies  remain  the  fault  of  the  authors. 


Table  of  Contents 


1  OVERVIEW  OF  THE  RESEARCH  PROCESS . 1 

1.1  Overview . 1 

1 .2  Definition  of  Terms . 3 

1 .3  Further  Readings . 5 


2  PREPARATION  FOR  RUNNING  EXPERIMENTS . 6 

2 . 1  Literature  in  the  Area . 6 

2.2  Choice  of  a  Term:  Participants  or  Subjects . 7 

2.3  Recruiting  Participants . 7 

2.4  Subject  Pools . 9 

2.5  Care,  Control,  Use,  and  Maintenance  of  Apparatus . 9 

2.6  Testing  Facility . 11 

2.7  Choice  of  Measures:  Performance,  Time,  Actions,  Errors,  Verbal  Protocol  Analysis, 

and  Other  Measures . 12 

2. 7. 1  Types  of  measures . 12 

2. 7.2  Levels  of  measurement . 12 

2. 7.3  Scales  of  measurement . 13 

2.8  Error  Data . 14 

2.9  Run  Analysis  with  Pilot  Data . 14 

2.10  Institutional  Review  Board  (IRB) . 15 

2.11  Further  Readings . 16 

3  POTENTIAL  ETHICAL  PROBLEMS . 17 


3 . 1  Recruitment  of  a  Broad  Selection  of  Subjects . 17 

3 .2  Talking  with  Subjects . 17 

3 .3  Coercion  of  Participants . 18 

3 .4  Sensitive  Data . 18 

3.5  Plagiarism . 19 

3.6  Fraud . 19 

3.7  Summary . 20 

3.8  Further  Readings . 20 


4  RISKS  TO  VALIDITY  TO  AVOID  WHILE  RUNNING  AN  EXPERIMENT . 21 

4 . 1  Validity  Defined  :  Surface,  Internal,  and  External . 21 

4.2  Risks  to  Validity . 23 

4.2.1  Power:  How  many  participants? . 23 

4.2.2  Experimenter  effects . 24 

4.2.3  Participant  effects . 25 

4.2.4  Randomization . 25 

4.3  Example  Problems . 26 

4.4  Further  Readings . 26 


5  RUNNING  A  RESEARCH  STUDY . 27 

5.1  Script . 27 

5.2  Piloting . 27 

5.3  Dress  Code  for  Experimenters . 27 

5.4  Welcome . 28 

5.5  Missing  Subjects . 28 

5.6  Decorum . 28 

5.7  Debriefing . 28 

5.8  Payments  and  Wrap-up . 29 

5.9  Simulator  Studies . 29 


v 


5.10  Problems  and  How  to  Deal  with  Them . 30 

5.11  Example  Problems . 30 

6  CONCLUDING  A  RESEARCH  SESSION  AND  STUDY . 31 

6.1  Data  Care,  Security,  and  Privacy . 3 1 

6.2  Data  Backup . 31 

6.3  Chance  for  Insights . 31 

7  EXAMPLE  RESEARCH  STUDIES . 33 

7 . 1  Skill  Retention  S  tud y . 33 

8  AFTERWORD . 36 

APPENDIX  A:  GLOSSARY . 37 

APPENDIX  B:  A  CHECKLIST  FOR  SETTING-UP  EXPERIMENTS . 38 

APPENDIX  C:  A  OVERVIEW  OF  STEPS  FOR  RUNNING  EXPERIMENTS . 39 

APPENDIX  D:  EXAMPLE  SCRIPT  TO  RUN  AN  EXPERIMENT . 40 

APPENDIX  E:  SAFETY  OF  EXPERIMENTS . 42 

APPENDIX  F:  EXAMPLE  CONSENT  FORM . 43 

APPENDIX  G:  EXAMPLE  DEBRIEF  FORM . 46 

APPENDIX  H:  EXAMPLE  IRB  APPLICATION . 47 

9  REFERENCES . 62 


vi 


1  Overview  of  the  Research  Process 


This  chapter  describes  briefly  where  experiments  fit  into  the  research  process.  If  you  have  had  an 
experimental  methods  course  or  a  research  design  course,  you  can  skip  this  chapter.  If  you  are  a 
new  research  assistant,  or  are  working  on  a  project  in  which  you  are  unclear  about  how  you  fit  in, 
this  chapter  may  provide  some  helpful  context.  We  also  define  some  common  terms  so  that  you 
can  better  communicate  with  the  principal  investigator  and  other  members  of  your  research  team. 

1.1  Overview 

Here  is  a  list  of  important  topics  that  form  our  overview  of  the  research  process. 

(1)  Establish  a  hypothesis 

The  most  common  hypothesis  is  that  one  factor  in  the  world  influences  another  factor 
that  can  be  measured.  This  is  the  most  common  hypothesis  tested.  To  test  this 
hypothesis,  the  first  factor,  such  as  time  exposed  to  a  stimuli  has  to  be  varied,  and  the 
second  factor,  such  as  how  long  something  is  remembered,  must  be  measured. 

Sometimes,  a  hypothesis  can  simply  be  that  the  area  is  interesting,  and  that  gathering 
data  will  provide  insights.  This  is  occasionally  referred  to  as  a  fishing  expedition, 
because  you  do  not  know  what  you  will  catch.  This  type  of  hypothesis  is  sometimes 
criticized  for  being  too  general,  but  for  exploratory  work,  it  can  be  very  successful. 

For  example,  we  suspected  that  subjects  when  confronted  with  a  problem  would 
choose  problem-solving  strategies  based  upon  certain  key  features,  but  we  did  not 
know  which  features  factored  into  the  participants’  choices.  So,  we  included  multiple 
types  of  problems  and  multiple  types  of  features.  Then,  with  analysis,  we  were  able  to 
pull  out  which  features  participants  based  their  decisions  upon  across  a  wide  range  of 
problem  types  (Reder  &  Ritter,  1992). 

(2)  Set  up  the  experiment 

To  investigate  what  you  want  to  know — to  test  your  established  hypothesis,  it  is 
necessary  to  draw  a  picture  in  your  mind  about  what  is  needed  to  achieve  your  goal. 

For  example,  suppose  that  you  want  to  know  how  students  can  retain  foreign 
vocabulary  words  in  terms  of  the  spacing  of  learning  (i.e.,  massed  or  distributed 
practice).  To  set  up  this  experiment,  an  experimenter  would  need  materials  for 
displaying  the  vocabulary  words  and  for  recording  responses  from  participants  (e.g., 
time  or  accuracy). 

(3)  Pilot  study  to  understand  the  theory’s  edges 

This  stage  practically  helps  you  to  identify  what  is  working,  what  is  not  working,  and 
what  is  missing  for  your  successful  investigation.  Probably,  a  theory  would  explain  a 
part  of  the  phenomena.  The  theory’s  edge  indicates  a  feasible  region  where  the 
phenomena  are  scientifically  explained  and  understood.  Understanding  the  theory’s 


1 


edge  is  important  because  you  can  identify  a  concrete  direction  of  your  research  based 
on  the  understanding  of  the  theory’s  edge. 

For  example,  you  might  ask  friends  and  colleagues  to  try  a  mirror  tracing  task.  You 
might  run  people  casually,  in  their  offices,  and  record  their  times  to  learn  how  their 
response  times  differ  by  stimuli.  You  would  not  report  these  results,  but  use  them  to 
adjust  your  apparatus  and  your  stimuli. 

(4)  Recruit  subjects 

Sometimes,  recruiting  subjects  is  easy,  and  then  again  sometimes  recruiting  is  hard.  It 
depends  on  your  local  circumstances.  If  you  have  access  to  a  subject  pool,  it  is  easier. 
If,  on  the  other  hand,  your  study  requires  particular  subjects  with  particular  expertise 
(such  as  airplane  pilots),  recruiting  is  harder. 

This  step  may  be  done  while  piloting  and  setting  up  the  study  if  there  are  few  risks  to 
preparing  the  study  or  if  subjects  are  hard  to  recruit.  If  subjects  are  easy  to  recruit  and 
the  study  harder  to  prepare,  it  may  be  done  in  the  opposite  order. 

(5)  Running  subjects 

Running  subjects  may  give  you  different  outcomes  than  those  of  your  pilot  study.  The 
primary  cause  for  these  differences  is  generally  due  to  individual  variability — 
participants  may  think  or  react  in  unanticipated  ways.  Or,  you  may  get  different 
results  because  your  study  is  more  formal.  In  either  case  or  even  the  case  where  there 
are  fewer  surprises,  you  are  interested  in  seeing  the  truth  about  the  world. 

(6)  Adjust  the  experiment 

It  may  be  necessary  to  adjust  the  experiment  to  address  problems  identified  in  a 
previous  stages,  or  as  a  second  study.  This  process  might  be  repeated,  however 
eventually  through  this  process  of  adjustment,  the  experiment  will  stabilize. 

(7)  Running  a  modified  study 

After  the  iteration  of  adjusting  a  series  of  your  pilot  studies,  you  would  reach  an 
experimental  design  that  gives  you  much  more  stable  and  interpretable  outcomes.  It 
often  seems  that  the  process  takes  more  time  than  you  think,  but  this  process  is 
necessary  to  produce  interpretable  and  repeatable  results. 


2 


(8)  Analyze  results 

If  you  have  been  careful,  you  have  analyzed  your  pilot  data  to  make  sure  that  the 
output  from  the  study  can  be  analyzed.  Sometimes,  timestamps  are  in  the  wrong 
format  to  be  read  by  an  analysis  program,  or  are  not  recorded,  or  the  subject’s  name 
has  remained  attached  to  a  file  (rather  than  their  ID).  Checking  that  the  data  can  be 
put  into  the  analysis  software  is  time  well  spent  before  you  run  participants  in  a  larger 
trial. 

(9)  Write  a  final  report 

Take  the  results  and  prepare  a  manuscript:  perhaps  in  the  form  of  a  technical  report 
for  a  sponsor,  as  a  conference  paper,  journal  article,  or  maybe  in  a  thesis.  There  are 
useful  books  on  this  area,  and  we  do  not  address  this  topic  further  here. 

Note  that  these  steps  are  normative;  they  are  what  should  happen.  In  practice,  this  process  often 
runs  in  parallel,  can  vary  in  order  (insights  do  not  always  come  between  experiments),  and  is 
iterative  (Boehm  &  Hansen,  2001).  Furthermore,  breakthroughs  frequently  result  from 
interactions  between  multiple  experiments  and  researchers  in  a  lab. 

1.2  Definition  of  Terms 

We  list  a  list  of  terms  with  their  definitions  that  are  frequently  used  in  the  process  of  the  research 
study. 

(1)  Null  hypothesis  vs.  alternative  hypothesis 

The  null  hypothesis  answers  that  there  is  nothing  (null)  going  on  in  the  experimental 
investigation — factor  1  has  no  influence  on  factor  2,  and  any  correlation  you  see  is  just 
noise,  a  random  occurrence.  That  is,  the  significance  testing  leads  you  to  make  a 
decision  that  there  is  no  other  influence. 

The  hypothesis  is  a  competing  with  the  null  hypothesis  and  something  that  researchers 
want  to  prove  to  be  true  through  the  research  process. 

Here  are  some  examples.  One  of  the  major  findings  by  Ebbinghaus  (1885/1913)  is  the 
principle  of  distributed  practice.  This  principle  holds  it  is  desirable  to  spread  out 
practice  rather  than  massing  it  together  in  a  session.  That  is,  a  10-minute  practice  per 
day  for  5  days  can  provide  a  better  learning  and  retention  rate  than  a  50-minute 
practice  in  one  day.  Based  on  this  principle,  let  us  construct  a  research  hypothesis.  In 
this  case,  the  null  hypothesis  would  be  that  there  is  no  statistically  significant 
difference  in  the  learning  and  retention  rates  between  distributed  and  massed  practice 
groups.  The  hypothesis  would  be  that  a  statistically  significant  difference  does  exist 
between  the  two  groups,  and  that  the  distributed  practice  group  displays  higher 
learning  and  retentions  rates  than  the  massed  practice  group. 

There  may  be  alternative  hypotheses  as  well.  If  the  study  is  not  well  designed  or  well 
run,  it  can  be  that  something  besides  the  hypothesis  is  causing  the  changes  observed. 
For  example,  if  the  first  20  subjects  to  show  up  are  put  into  one  group,  and  the  last  20 


3 


to  show  up  are  put  into  another  group,  then  the  differences  between  groups  might  not 
be  caused  by  how  they  are  treated,  but  by  whatever  caused  them  to  be  in  the  first  or 
last  group,  such  as  conscientiousness. 

(2)  Significance  testing 

Significance  testing  refers  to  the  process  used  to  determine  whether  an  effect  (or 
effects)  observed  in  the  experiment  is  a  real  effect,  rather  than  just  the  result  of  the 
statistical  analysis.  In  this  significance  test,  you  would  accept  or  reject  either  the  null 
hypothesis  or  the  alternative  hypothesis. 

(3)  Independent  variable  vs.  dependent  variable 

For  more  information,  please  refer  to  Section  2.7.1. 

(4)  Stimuli 

Stimuli  are  events  that  evoke  or  cause  a  reaction.  That  is,  in  this  context,  stimuli  are 
events  evoking  reactions  from  a  subject. 

(5)  Reaction  time  (RT) 

Reaction  time  is  an  elapsed  time  between  the  presentation  of  a  stimulus  and  the 
consequent  response  to  that  stimulus  from  the  subject. 

(6)  Principal  Investigator 

A  principal  investigator  is  an  official  point  of  contact  and  is  responsible  for  a  grant 
contract.  A  principal  investigator  can  also  take  the  role  of  the  lead  researcher, 
responsible  for  conducting  experiments  involving  human  participants  and  gathering 
data. 

(7)  Lead  researcher 

A  lead  researcher  generally  indicates  a  person  who  is  responsible  for  designing  and 
running  experiments.  An  experimenter  actually  administers  experimental  procedures 
to  participants  and  collects  data. 

(8)  IRB 

IRB  stands  for  an  Institutional  Review  Board,  which  is  a  review  committee  to  help 
protect  the  rights  and  welfare  of  human  participants  in  a  research  study.  An  IRB  can 
(a)  appro  ve/disapprove  a  research  study,  (b)  modify  a  research  study,  (c)  conduct 
continuing  reviews,  (d)  observe/verify  changes,  (e)  suspend  or  terminate  approval,  and 
(f)  observe  the  consent  process  and  the  research  procedures. 

Studies  in  US  institutions  need  IRB  approval  according  to  Federal  law.  Great  review 
boards  can  do  this  quickly  and  help  you  with  your  research,  pointing  out  how  to  be 
more  responsible  to  subjects  and  how  to  improve  your  study  (gathering  data  that  is 
interpretable  is  also  a  responsibility);  good  review  boards  can  do  this  quickly.  They 
will  also  require  your  cooperation,  and  you  should  look  forward  to  working  with  them. 


4 


(9)  Informed  consent 


Informed  consent  is  a  process  to  (a)  provide  specific  information  about  the  research 
study  and  its  procedures  to  the  participants  (or  subjects),  (b)  answer  questions  to 
ensure  the  participants  understand  the  research,  (c)  provide  the  participants  with 
adequate  time  to  consider  their  decisions,  and  (d)  obtain  the  voluntary  agreement  from 
the  participants  to  take  part  in  the  research  study. 

1.3  Further  Readings 

A  course  in  experimental  methods  is  probably  the  best  way  to  learn  about  how  to  design  and  run 
studies.  In  addition,  we  can  provide  a  list  of  suggested  reading  materials  that  provide  you  with 
further  knowledge  about  experimental  design  and  methods.  We  list  them  in  an  alphabetical  order 
of  the  first  author. 


•  Bernard,  H.  R.  (2000).  Social  research  methods:  Qualitative  and  quantitative  approaches. 
Thousand  Oaks,  CA:  Sage. 

This  is  a  relatively  large  book.  It  covers  a  wide  range  of  methods,  some  in  more  depth  than 
others.  It  includes  some  instructions  for  how  to  perform  the  methods. 


•  Coolican,  H.  (2006).  Introduction  to  research  methods  in  psychology  (3rd  ed.).  London, 
UK:  Hodder  Arnold. 

•  Cozby,  P.  C.  (2004).  Methods  in  behavioral  research  (8th  ed.).  New  York,  NY:  McGraw- 
Hill. 

•  Leary,  M.  R.  (2004).  Introduction  to  behavioral  research  methods  (4th  ed.).  Boston,  MA: 
Pearson. 

•  Martin,  D.  W.  (1995).  Doing  psychology  experiments  (4th  ed.).  Pacific  Grove,  CA: 
Brooks/Cole  Publishing. 

•  Ray,  W.  J.  (2003).  Methods:  Toward  a  science  of  behavior  and  experience  (7th  ed.). 
Belmont,  CA:  Wadsworth/Thompson  Learning. 

This  is  a  book  for  the  first  course  in  experimental  methods  in  psychology.  It  is  a  useful  and 
gentle  introduction  to  how  to  create  and  run  studies  and  how  to  present  the  results. 


5 


2  Preparation  for  Running  Experiments 

Broadly  speaking,  scientists  and  engineers  investigate  scientific  inquiries;  and  experiments 
constitute  one  powerful  form  of  investigation.  For  many  studies,  human  participation  is 
necessary  to  adequately  explore  the  question.  For  instance,  evaluating  the  usability  of  a  haptic 
interface  (e.g.,  a  Wii  remote)  before  its  introduction  to  the  market  would  be  an  example.  The 
question,  then,  is  what  considerations  should  inform  the  investigator  when  conducting  these  kinds 
of  research  studies. 

In  general,  scientific  inquiries  in  the  areas  of  human-computer  interaction  (HCI),  human  factors, 
cognitive  psychology,  and  cognitive  science  require  the  involvement  of  human  participants.  One 
distinguishing  factor  of  these  disciplines,  and  thus  experiments  in  these  areas,  has  been  the 
centrality  of  the  human  participant. 

Consequently,  working  in  these  areas  requires  not  only  understanding  the  theoretical  and  ethical 
issues  incumbent  to  running  human  participants  but  also  the  practical  aspects  of  the  process  itself. 
To  start  to  frame  this  discussion,  we  provide  an  overview  of  this  process,  and  issues  related  to  it. 

Let  us  again  consider  a  usability  study  evaluating  a  haptic  interface.  For  this  investigation,  a  lead 
research  scientist  or  a  lead  researcher  would  establish  a  study  hypothesis  and  design  an 
experiment  by  first  defining:  what  to  measure  (dependent  variables),  what  factors  to  manipulate 
(independent  variables),  and  what  environmental  conditions  to  consider. 

There  is  a  lab.  Perhaps  in  this  lab,  multiple  experiments  are  going  on  at  the  same  time.  Joining 
the  lab  as  a  new  research  assistant,  you  have  come  to  help  out  and  to  learn  in  this  area, 
specifically  with  running  research  studies.  What  do  you  do?  Where  do  you  start?  How  do  you 
avoid  common  and  easily  fixed  problems? 

2.1  Literature  in  the  Area 

This  book  does  not  assume  that  you  have  a  background  in  statistics  or  studied  experimental 
design,  but  to  help  run  a  study  you  often  do  not  need  to  know  these  areas  (but  they  do  help!).  If 
you  need  help  in  these  areas,  there  are  other  materials  that  will  prepare  you  to  design  experiments 
and  analyze  experimental  data.  In  addition,  most  graduate  programs  with  concentrations  in  HCI, 
cognitive  science,  or  human  factors  engineering  feature  coursework  that  will  help  you  become 
proficient  in  these  topics. 

Many  introductory  courses  in  statistics,  however,  focus  primarily  on  introducing  the  basics  of 
ANOVA  and  regression.  These  tools  are  unsuitable  for  many  studies  analyzing  human  subject 
data  where  the  data  is  qualitative  or  sequential.  Care,  therefore,  must  be  taken  to  design  an 
experiment  that  collects  the  proper  kinds  of  data.  If  ANOVA  and  regression  are  the  only  tools  at 
your  disposal,  we  recommend  that  you  find  a  course  focusing  on  the  design  of  experiments 
featuring  human  participants,  as  well  as  the  analysis  of  human  data,  and  that  you  gather  data  that 
can  be  used  in  a  regression  because  it  can  be  used  to  make  stronger  predictions. 

Returning  to  the  topic  of  readings,  it  is  generally  useful  to  have  read  in  the  area  in  which  you  are 
running  experiments.  This  reading  will  provide  you  further  context  for  your  work,  including 


6 


discussions  about  methods,  types  of  subjects,  and  pitfalls  you  may  encounter.  For  example,  the 
authors  of  one  our  favorite  studies,  an  analysis  of  animal  movements,  notes  that  data  collection 
had  to  be  suspended  after  having  been  chased  by  elephants!  If  there  are  elephants  in  your 
domain,  it  is  useful  to  know  about  them.  There  are,  of  course,  less  dramatic  problems  such  as 
common  mistakes  subjects  make,  correlations  in  stimuli,  self-selection  biases  in  a  subject 
population,  power  outages,  printing  problems,  or  fewer  participants  than  expected.  While  there 
are  reasons  to  be  blind  to  the  hypothesis  being  tested  by  the  experiment  (that  is,  you  do  not  know 
what  treatment  or  group  the  subject  is  in  that  you  are  interacting  with,  so  that  you  do  not 
implicitly  or  inadvertently  coach  the  subjects  to  perform  in  the  expected  way),  if  there  are 
elephants,  good  experimenters  know  about  them,  and  prepared  research  assistants  particularly 
want  to  know  about  them! 

As  a  result,  the  reading  list  for  any  particular  experiment  is  very  individualized.  You  should  talk 
to  other  experimenters,  as  well  as  the  lead  researcher  about  what  you  should  read. 

2.2  Choice  of  a  Term:  Participants  or  Subjects 

Disciplines  vary  as  to  which  term  they  prefer:  subject  or  participant.  Participant  is  the  newer 
term,  and  was  adopted  by  many  research  communities  to  emphasize  the  researcher’s  ethical 
obligations  to  those  participating  in  their  experiments.  Nevertheless,  subject  is  still  commonly 
used,  and  appears  in  older  research.  For  students  in  many  psychology  programs,  the  term, 
participants ,  is  preferred  to  that  of  subjects.  The  Publication  Manual  of  the  American 
Psychological  Association  ( APA ),  5th  ed.  (American  Psychological  Association,  2001,  p.  70) 
suggests  replacing  the  impersonal  term,  subjects ,  with  the  more  descriptive  term,  participants . 
The  APA  goes  on  to  define  participants  as  individuals:  college  students,  children,  or 
respondents. 

Whether  following  the  APA  guideline  or  not,  we  should  recognize  that  S ,  Ss,  S's,  E ,  Es,  E’s 
indicate  Subject ,  Subjects ,  Subject's ,  Experimenter ,  Experimenters ,  and  Experimenter's  in  earlier 
research  — Fitts’s  1954  study  is  one  example.  Furthermore  even  within  the  discipline  of 
psychology,  opinion  can  be  split.  Roediger  (2004)  argues  against  the  change  to  participants 
made  in  the  latest  version  of  the  APA 's  Publication  Manual.  He  argues  that  subjects  is  both  more 
consistent  and  clearer,  noting  that  the  term  has  been  in  use  since  the  1800’s  and  that  it  better 
defines  the  relationships  involved.  He  argues  that  the  term  .participants,  fails  to  adequately 
capture  the  distinction  between  the  experimenter  and  those  in  the  study — strictly  speaking 
experimenters  are  participants  as  well.  We  use  these  terms  interchangeably  in  this  document 
because  we  recognize  other  research  communities  may  still  prefer  subjects ,  and  because  not  all 
psychologists  are  members  of  the  APA. 

2.3  Recruiting  Participants 

Recruiting  participants  for  your  experiment  can  be  a  time-  consuming  and  potentially  difficult 
task,  but  it  is  a  very  important  procedure  to  produce  meaningful  data.  An  experimenter,  thus, 
should  carefully  plan  out  with  the  lead  researcher  (or  the  principal  investigator)  to  conduct 
successful  participants  recruitment  for  the  research  study.  Ask  yourself,  “What  are  the  important 


7 


characteristics  that  my  participants  need  to  have?”  Your  choices  will  be  under  scrutiny,  so  having 
a  coherent  reason  for  which  participants  are  allowed  or  disallowed  into  your  study  is  important. 

First,  it  is  necessary  to  decide  a  population  of  interest  from  which  you  would  recruit  participants. 
For  example,  if  an  experimenter  wants  to  measure  the  learning  effect  of  foreign  language 
vocabulary,  it  is  necessary  to  exclude  participants  who  have  prior  knowledge  of  that  language.  In 
addition,  it  may  be  necessary  to  consider  age,  educational  background,  gender,  etc.,  to  correctly 
choose  the  target  population. 

Second,  it  is  necessary  to  decide  how  many  participants  you  would  recruit.  The  size  of 
participants  can  affect  your  final  results.  The  more  participants  you  can  recruit,  the  more  reliable 
your  results  will  be.  However,  limited  resources  (e.g.,  time,  money,  etc.)  force  an  experimenter 
to  find  the  appropriate  and  reasonable  number  of  participants.  You  may  need  to  refer  to  previous 
studies  to  get  some  ideas  of  the  number  of  participants,  or  may  need  to  calculate  the  power  of  the 
sample  size  for  the  research  study,  if  possible  (most  modem  statistical  books  have  a  discussion  on 
this,  and  teach  you  how  to  do  this,  e.g.,  Howell,  2008). 

There  are  several  ways  that  participants  can  be  recmited.  The  simplest  way  is  to  use  the 
experimenters,  themselves.  In  simple  vision  studies,  this  is  often  done  because  the  performance 
differences  between  people  in  these  types  of  tasks  is  frequently  negligible  and  knowing  the 
hypothesis  to  be  tested  does  not  influence  performance.  Thus,  the  results  remain  generalizable 
even  with  a  small  number  of  participants. 

The  next  way  that  subjects  can  be  recmited  that  we  will  consider  is  a  sample  of  convenience. 
Samples  of  convenience  consist  of  people  who  are  accessible  to  the  researcher.  Many  studies  use 
this  approach,  so  much  so  that  this  is  not  often  mentioned.  Generally  for  these  studies,  only  the 
sampling  size  and  some  salient  characteristics  are  noted  that  might  possibly  influence  the 
participants’  performance  on  the  task.  These  factors  might  include  age,  major,  sex,  education 
level,  and  factors  related  to  the  study,  such  as  nicotine  use  in  a  smoking  study,  or  number  of  math 
courses  in  a  tutoring  study. 

In  studies  using  samples  of  convenience,  try  distributing  an  invitation  email  to  a  group  mailing 
list  (e.g.,  students  in  the  psychology  department  or  an  engineering  department).  Also,  you  can 
post  recmitment  flyers  in  a  student  board,  or  make  an  advertisement  in  a  student  newspaper.  Use 
efficiently  all  resources  and  channels  that  are  available  to  you. 

There  are  disadvantages  to  using  a  sample  of  convenience.  Perhaps  the  largest  is  that  the 
resulting  sample  is  less  likely  to  lead  to  generalizable  results.  The  subjects  you  recmit  are  less 
likely  to  represent  a  sample  from  a  larger  population.  Students  who  are  subjects  are  different 
from  students  who  are  not  subjects.  To  name  just  one  feature,  they  are  more  likely  to  take  a 
psychology  class  and  end  up  in  a  subject  pool.  And,  the  sample  itself  might  have  hidden 
variability  in  it.  The  subjects  you  recruit  from  one  method  (an  email  to  them)  or  from  another 
method  (poster)  may  be  different,  and  we  know  they  differ  over  time,  those  that  come  early  to 
fulfill  a  course  requirement  are  more  conscientious  than  those  that  come  late.  So,  for  sure, 
randomly  assign  these  types  of  subjects  to  the  conditions  in  your  study. 


8 


The  largest  and  most  carefully  organized  sampling  group  is  a  random  sample.  In  this  case, 
researchers  randomly  sample  a  given  population  by  carefully  applying  sampling  methodologies 
meant  to  ensure  statistical  validity  and  equal  likelihood  of  selecting  each  potential  subject. 

Asking  students  questions  at  a  football  game  as  they  go  in  does  not  constitute  a  random  sample — 
some  students  do  not  go  (selection  bias).  Other  methods  such  as  selecting  every  10th  student 
based  on  a  telephone  number  or  ID  introduce  their  own  biases.  For  example,  some  students  do 
not  have  a  publicly  available  phone  number,  and  some  subpopulations  register  early  to  for  their 
ID  numbers.  Truly  choosing  a  random  sample  is  difficult,  and  you  should  discuss  how  best  to  do 
this  with  your  lead  researcher. 

2.4  Subject  Pools 

One  approach  for  recruiting  participants  is  a  subject  pool.  Subject  pools  are  generally  groups  of 
undergraduates  who  are  interested  in  learning  about  psychology  through  participation.  Most 
Psychology  departments  organize  and  sponsor  subject  pools.  For  more  information,  refer  to  the 
next  section. 

Subject  pools  offer  a  potential  source  of  participants.  You  should  discuss  this  as  an  option  with 
your  lead  researcher,  and  where  appropriate,  learn  how  to  fill  out  the  requisite  forms.  If  the 
students  in  the  study  are  participating  for  credit,  you  need  to  be  particularly  careful  with  recording 
who  participated  because  the  students’  participation  and  the  proof  of  that  participation  represent 
part  of  their  grade. 

A  whole  book  could  be  written  about  subject  pools.  Subject  pools  are  arrangements  that 
psychology  or  other  departments  provide  to  assist  researchers  and  students.  The  department  sets 
up  a  way  for  experimenters  to  recruit  subjects  for  studies.  Students  taking  particular  classes  are 
either  provided  credit  towards  the  class  requirement  or  extra  credit. 

The  theory  is  that  participating  in  a  study  provides  additional  knowledge  about  how  studies  are 
run,  and  provides  the  participant  with  additional  knowledge  about  a  particular  study.  The 
researchers,  in  turn,  receive  access  to  a  pool  of  potential  subjects. 

When  students  do  not  wish  to  participate  in  a  study,  alternative  approaches  for  obtaining  course 
credit  are  provided. 

2.5  Care,  Control,  Use,  and  Maintenance  of  Apparatus 

What  materials  do  you  need  to  run  experiments?  The  experiments  in  a  controlled  environment 
(e.g.,  a  laboratory)  usually  require  participants  to  interact  with  a  computer  device,  a  prototype,  or 
a  mock-up.  For  example,  it  is  possible  to  implement  a  task  environment  in  a  computer  screen — 
such  as  an  air  traffic  control  task  like  Argus  (Schoelles  &  Gray,  2001),  a  driving  simulator  like 
Distract-R  (Salvucci,  in  press),  experimental  tasks  with  E-Prime  (e.g.,  MacWhinney,  St.  James, 
Schunn,  Li,  &  Schneider,  2001),  or  a  spreadsheet  task  environment  (Kim,  Koubek,  &  Ritter, 
2007).  Part  of  what  you  will  have  to  do  is  to  understand  the  task  environment  so  that  you  can 
prepare  it  for  each  session,  save  the  data  if  it  collects  data,  and  shut  it  down  after  each  session. 


9 


As  you  begin  to  work  on  your  research  task,  you  are  likely  to  consider  several  approaches  for 
improving  your  study.  Finding,  developing,  or  modifying  the  task  environment  to  support  your 
study  is  often  an  early  consideration.  The  task  environment  provides  the  setting  for  investigating 
the  questions  of  interest,  and  having  the  right  task  environment  is  a  key  element  to  a  successful 
study.  If  designing  and  implementing  a  new  task  environment  for  your  research  study  seems 
infeasible,  try  reusable  and  sharable  environments. 

After  choosing  and  setting  up  the  task  environment,  the  next  step  is  to  determine  what  method 
you  will  use  to  record  user  performance.  Data  collection  deserves  serious  thought,  in  an  attempt 
to  provide  meaningful  results.  Data  can  be  qualitative  (i.e.,  not  in  a  numerical  form)  or 
quantitative  (i.e.,  in  a  numerical  form).  Different  hypothesis  and  theories  require  different  types 
of  data  to  test  them,  and  thus  methods  to  collect  data.  For  example,  you  can  use  a  camcorder  in 
an  interview  to  gather  qualitative  information  or  a  keystroke  logger  like  RUI  (Kukreja, 

Stevenson,  &  Ritter,  2006)  to  measure  numerical  values  of  quantitative  data  in  unobtrusive  and 
automatic  ways.  We  suggest  avoiding  manually  recording  data — it  is  hard,  takes  a  significant 
amount  of  time,  and  is  prone  to  error.  Though,  sometimes,  manual  data  collection  is  unavoidable; 
often  with  a  little  forethought  ways  can  be  found  to  automate  the  process. 

An  apparatus  is  often  required  to  gather  behavioral  data.  In  cognitive  science,  recording  user 
behavior  by  using  experimental  software,  a  video  recorder,  a  voice  recorder,  or  a 
keystroke/mouse  logger,  etc  are  all  common  practices.  There  are  also  tools  for  generating  studies 
such  as  ePrime.  Also,  some  studies  require  using  an  eye-tracker  to  gather  eye-movement  data. 

Experimental  software 

Many  studies  are  performed  with  custom  built,  or  bespoke  software.  The  research  team 
conducting  the  study  usually  develops  these  custom  applications;  and  they  can  vary  from  a  simple 
program  to  present  stimuli  and  record  reaction  times  to  more  complex  programs  (interactive 
simulations  for  instance).  As  new  research  assistant,  you  will  be  instructed  on  how  to  start  up  and 
run  the  software  necessary  for  your  work.  On  the  other  hand,  as  you  run  subjects  with  such 
programs,  try  moving  from  a  passive  to  an  active  user.  Make  any  suggestions  that  you  think 
might  improve  the  program’s  usability  as  they  arise,  note  mistakes  in  the  program,  and  observe 
how  subjects  interact  with  the  program  in  novel  or  interesting  ways.  These  insights  can  lead  to 
further  studies  and  to  further  hypotheses  to  test. 

E-Prime 

E-Prime1  was  the  first  commercial  tool  designed  to  generate  psychological  experiments  on  a 
personal  computer  (MacWhinney,  St.  James,  Schunn,  Li,  &  Schneider,  2001).  E-Prime  is 
compatible  with  Microsoft  Windows®  XP/Vista.  PsyScope2  is  another  experiment  generation 
program,  and  a  predecessor  of  E-Prime.  You  can  download  it  free  under  a  GNU  General  Public 
License3.  PsyScope  runs  on  the  Macintosh.  You  may  be  asked  to  use  these  tools  in  your  current 
study  or  may  find  them  to  be  great  value  in  producing  study  stimuli  more  quickly. 


1  http://www.pstnet.com/products/e-prime 

2  http://psy.ck.sissa.it 

3  http://www.gnu.org/copyleft/gpl.html 


10 


Keystroke  loggers 

It  is  often  useful  to  record  the  user’s  behavior  while  they  perform  the  task,  not  just  the  total  task 
time.  This  can  be  done  in  several  ways.  Some  researchers  have  used  video  recordings.  This 
provides  a  very  stable  result  that  can  include  multiple  details.  It  also  can  provide  a  rich  context, 
particularly  if  both  the  subject  and  his  or  her  surroundings  are  recorded.  On  the  other  hand, 
analyzing  video  recordings  is  time  consuming  and  can  be  error  prone.  Analyzing  video  data 
requires  examining  the  video  frame-by-frame  to  find  when  the  user  performs  each  action,  and 
then  recording  each  action  by  hand  into  your  dataset. 

Another  approach  is  to  record  just  the  keystrokes  or  mouse  clicks.  There  are  commercial  versions 
available  from  companies  like  Noldus  that  will  record  keystrokes.  We  have  also  designed  a 
keystroke  logger,  RUI  (Recording  User  Input).  RUI  is  a  keystroke  and  mouse  action  logger  for 
the  Windows  and  Mac  OS  X  platforms  (Kukreja,  Stevenson,  &  Ritter,  2006).  It  is  very  useful 
tool  for  recording  user  behavior  in  human- computer  interaction  studies.  RUI  can  be  used  to 
measure  response  times  of  participants  interacting  with  a  computer  interface  over  time. 

Using  RUI,  however,  does  raise  issues  regarding  privacy  in  public  clusters  (e.g.,  a  classroom). 
University  policies  almost  universally  prohibit  installing  any  tool  for  experimentation  that  obtains 
a  user’s  information  on  identity  such  as  a  login  ID  or  a  password  (Kim  &  Ritter,  2007). 
Fortunately,  Kim  and  Ritter  (2007)  describe  one  possible  portable  solution  to  this  problem.  They 
used  a  simple  shell  script  to  automatically  run  RUI  on  an  external  drive,  a  jump  drive.  Because 
RUI  is  operated  from  an  external  drive  it  provides  a  way  to  efficiently  use  RUI  on  public  cluster 
machines  and  then  remove  it  when  the  study  is  over. 

Eye-trackers 

An  eye  tracker  is  a  device  to  measure  eye  positions  and  movements.  It  can  offer  useful  data  of 
cognitive  processes  when  a  user  interacts  with  an  interface  (e.g.,  a  computer  screen,  a  physical 
product,  etc).  This  device  is  sensitive,  requiring  special  care  to  guarantee  the  measurement’s 
quality. 

2.6  Testing  Facility 

A  testing  facility  can  be  called  a  psychological  testing  room,  human  factors  lab,  an  ergonomics 
lab,  a  usability  lab,  or  a  HCI  lab.  Rosson  and  Carroll  (2002)  state  that  a  usability  lab  is  a 
specially  constructed  observation  room.  In  this  observation  room,  an  investigator  can  simulate  a 
task  environment  and  record  the  behavior  of  participants.  Thus,  the  room  should  be  insulated 
from  outside  influences,  particularly  noise.  However,  it  is  sometimes  necessary  to  observe  and 
record  behaviors  of  a  group  of  participants  interacting  with  each  other.  In  these  cases,  it  may  be 
hard  to  capture  this  data  in  a  lab  setting.  Ideally,  the  testing  facility  should  be  flexible  enough  to 
conduct  various  types  of  human  involved  research. 

Jacob  Nielson  (1994)  edited  a  special  issue  about  usability  laboratories.  This  special  issue 
provides  several  representative  usability  laboratories  in  computer,  telecommunications,  and 
consumer  product  companies  (e.g.,  IBM,  Symantec,  SAP,  Phillips,  or  Microsoft,  etc.).  You  can 
obtain  more  details  about  these  facilities  from  this  special  issue.  While  this  special  issue  is 


11 


somewhat  dated,  the  underlying  concerns  and  some  of  the  technological  details  remain  accurate, 
in  addition  many  of  the  social  processes  and  uses  for  video  have  only  become  more  important. 

If  you  are  designing  your  own  study,  you  should  try  to  arrange  access  to  a  room  that  allows 
participants  to  focus  on  the  experimental  task.  Lead  researchers  will  often  have  such  rooms,  or 
can  arrange  access  to  them. 

2.7  Choice  of  Measures:  Performance,  Time,  Actions,  Errors,  Verbal  Protocol 
Analysis,  and  Other  Measures 

2.7.1  Types  of  measures 

There  are  several  types  of  measures.  Questionnaires  are  one  common  and  flexible  type.  By 
answering  the  questions,  participants  self-report  about  the  question,  thus  providing  researchers 
insights  into  their  behavior.  The  quality  and  type  of  these  responses,  however,  depend  upon  the 
quality  and  type  of  the  questions  asked — so  carefully  selected  and  carefully  worded  questions  are 
important. 

One  example  where  questionnaires  can  be  used  effectively  is  studying  self-judgment  and  it 
effects.  Under  certain  conditions,  our  feelings  about  our  knowledge  and  our  actual  knowledge 
may  differ.  In  this  case,  our  hypothetical  researcher  asks  participants  to  make  a  judgment  about 
what  they  know  after  memorizing  vocabulary  words.  Using  a  Likert  scale  is  a  common  way  to 
measure  self-judgment.  Likert  scales  typically  consist  of  five  points  with  questions  ranging  from 
“Strongly  disagree”  to  “Strongly  agree”.  Our  hypothetical  researcher  would  then  test  the 
participants  and  compares  the  participants’  responses  about  their  knowledge  with  the  results. 

Other  types  of  measures  can  include  physiological  measures.  Cozby  (2004)  introduces  a  few 
popular  physiological  measures  such  as  galvanic  skin  response  (GSR),  electromyogram  (EMG), 
and  electroencephalogram  (EEG)  that  help  us  understand  psychological  variables.  Also,  fMRI 
(functional  magnetic  resonance  image)  is  a  popular  method  of  measuring  and  examining  brain 
activities.  If  you  are  interested  in  learning  more  about  these  techniques,  refer  to  the  section  of 
Further  Readings,  specifically  Psychophysiological  recording  (Stem,  Ray,  &  Quigley,  2001). 

2.7.2  Levels  of  measurement 

Often  within  a  single  study,  multiple  measures  with  different  characteristics  are  gathered.  For 
instance,  you  can  measure  the  task  completion  time;  or  you  can  measure  the  number  and  the 
times  of  the  keystrokes  and  mouse  actions  performed  by  the  participants  during  the  task.  You  can 
also  measure  what  errors  were  made  during  the  task,  and  so  on.  Let  us  discuss  some  common 
measures  taken  in  an  HCI  or  cognitive  science  experiment. 

It  is  necessary  to  decide  what  you  are  observing  and  measuring  from  the  participants  who  are 
performing  the  experimental  task.  The  decision  is  important  because  the  choice  of  measures  is 
directly  related  to  what  aspects  of  the  participants’  behavior  is  being  captured  by  the  task.  In 
general  there  are  two  types  of  variables:  (a)  independent  variables,  and  (b)  dependent  variables. 


12 


Independent  variables  cause,  or  manipulate,  the  changes  in  the  participants’  behavior  that  the 
researchers  seek  to  observe  during  the  study.  Thus,  independent  variables  are  sometimes  called 
manipulated  variables,  treatment  variables,  or  factors  (Keppel  &  Wickens,  2004). 

To  cement  our  understanding  of  variables,  let  us  presume  that  we  want  to  measure  how  humans 
forget  something  they  have  learned.  We  will  return  to  this  example  in  more  detail  in  a  later 
chapter;  but  for  now,  we  will  focus  the  study’s  independent  and  dependent  variables.  Variables 
that  can  manipulate  forgetting  performance  include  training  types,  retention  intervals  (how  long  a 
participant  will  retain  learned  information),  or  input  modalities  (what  types  of  skills  a  participant 
is  to  learn).  Thus,  we  would  consider  these  variables  the  study’s  independent  variables.  They 
deliberately  vary  to  create  the  effects,  they  are  independent. 

Dependent  variables  indicate  what  we  will  observe.  Their  values  are  (presumed  to  be)  dependent 
on  the  situation  set  up  by  the  independent  variables.  Dependent  variables  can  either  be  directly 
observed  or  may  be  derived.  The  NASA  TLX,  for  example,  allows  researchers  to  derive  a 
measure  of  workload.  We  directly  measure  each  of  the  six  individual  subscales,  but  the  results 
from  the  tradeoff  questions  are  derived.  In  fact,  performance  is  often  a  derived  measure.  That  is, 
the  dependent  variable  is  affected  by  the  manipulation  of  the  independent  variable.  We  can 
observe  the  time  that  is  required  to  complete  a  task  if  the  investigation  is  to  understand  human 
performance  caused  by  forgetting.  Also,  we  can  observe  errors  produced  by  participants  to 
measure  forgetting.  These  variables  are  considered  to  be  dependent  variables.  There  can  be  one 
or  more  dependent  variables.  One  dependent  variable  in  an  experiment  is  referred  to  univariate 
methods,  and  more  than  two  dependent  variables  are  referred  to  multivariate  methods. 

To  sum  up,  dependent  variables  are  the  responses  being  observed  during  the  study  while 
independent  variables  are  those  factors  that  researchers  manipulate  to  either  cause  or  change 
those  responses. 

2.7.3  Scales  of  measurement 

Variables  can  be  basically  measured  using  four  scales  (Ray,  2003):  (a)  nominal  measurements, 

(b)  ordinal  measurements,  (c)  interval  measurements,  and  (d)  ratio  measurements.  Knowing 
these  scales  of  measurement  is  important  because  the  data  interpretation  techniques  available  to 
you  for  interpreting  the  results  are  a  function  of  the  scales  of  measurement  used,  and  the  use  of 
such  data,  perhaps  even  how  it  is  stored  depends  on  what  kind  of  data  it  is. 

Nominal  (also  referred  to  as  categorical)  measurements  are  used  to  classify  or  name  variables. 
There  is  no  numeric  measure  of  values  representing  names  or  separate  categories.  For  example, 
participants  can  be  classified  into  two  groups — a  male  group  and  a  female  group,  to  measure 
performance  on  using  a  GPS  navigation  system.  In  this  case,  the  gender  difference  is  an 
independent  variable  to  compare  performance.  Or,  if  the  numbers  1  to  10  are  treated  as  words, 
such  as  how  often  they  are  said,  then  there  is  not  necessarily  even  an  order  to  them,  they  could  be 
sorted  alphabetically. 

Ordinal  measurements,  in  contrast,  represent  some  degree  of  quantitative  difference  (or  relative 
amount).  For  example,  football  rankings  in  the  Big  Ten  conference  are  an  ordinal  measurement, 


13 


as  are  ratings  on  a  scale  of  1  to  10.  Differences  between  the  first  and  second  team,  between  9th 
and  10th,  and  between  ratings  of  4  and  5  and  6  and  7  are  not  necessarily  equal,  just  ordered. 

Interval  measurements  rely  upon  a  scale  values  based  on  a  single  underlying  quantitative 
dimension.  The  distance,  therefore,  between  the  consecutive  scale  values  are  meaningful.  For 
example,  the  interval  between  6  and  12  equals  the  interval  between  12  and  18.  That  is,  the 
distance  between  the  consecutive  values  is  6. 

Ratio  measurements  determine  value  with  respect  to  an  absolute  zero — there  is  no  length  shorter 
than  0  inches  for  instance.  The  most  common  ratio  measurement  can  be  found  in  a  count 
measure  (i.e.,  the  number  of  hits  or  misses).  For  example,  in  a  shooting  game,  the  number  of  hits 
is  used  to  determine  the  firer’s  accuracy. 

Frequently,  sets  of  sequential  data,  or  protocols,  are  gathered  from  human  participants  for  a  given 
task.  Protocols  may  be  multiple  streams  of  data  including  verbal  utterances,  motor  actions, 
environmental  responses,  or  eye  movements  (Newell  &  Simon,  1972).  As  an  example  of  a  verbal 
protocol,  consult  the  testing  methodology  developed  by  Ritter  and  Larkin  (1994)  for  the 
principled  analysis  of  user  behavior. 

Verbal  data  often  provides  insights  into  understanding  human  behavior.  Ericsson  and  Simon 
(1993)  published  a  summary  of  how  and  when  to  use  verbal  reports  as  data  to  observe  humans’ 
internal  cognitive  processes.  The  basic  assumption  of  the  verbal  protocol  theory  is  that 
verbalization  of  a  human’s  memory  contents  (not  their  view  of  their  thought  processes)  can  be 
used  to  derive  the  sequence  of  thoughts  to  complete  a  task.  Thus,  verbalization  can  be  a  valid 
form  of  data  representation  that  offers  us  certain  unique  insights  into  cognition  (see  chunking). 
This  type  of  data  requires  audio  recordings,  and  often  comes  with  special  apparatus  for  recoding 
and  special  software  and  tools  for  analyzing  the  results.  It  is  time  consuming,  but  can  be  very 
helpful  for  understanding  how  the  task  is  performed. 

2.8  Error  Data 

Another  type  of  data  to  gather  is  error  data.  Error  data  consists  of  trials  or  examples  where 
subjects  did  not  perform  the  experimental  task  or  some  aspects  of  the  task  correctly.  This  type  of 
data  can  provide  useful  examples  of  where  cognition  breaks  down.  In  addition,  it  helps  describe 
the  limits  of  performance  and  cognition. 

Error  data  is  generally  more  expensive  to  collect  because  in  most  cases  participants  perform  the 
task  correctly.  Thus,  more  trials  have  to  be  run  to  gather  a  hundred  errors  than  it  takes  to  gather  a 
hundred  correct  responses.  If  errors  are  not  interesting  theoretically  for  your  research  study,  some 
pilot  running  of  the  experiments  may  be  required  to  generate  an  experiment  where  errors  do  not 
occur  too  often. 

2.9  Run  Analysis  with  Pilot  Data 

Before  launching  your  experimental  study,  we  can  highly  recommended  that  you  run  a  pilot 
subjects,  gather  data  from  them,  and  analyze  the  data.  The  number  to  run  can  be  found  with 
experience,  or  by  talking  with  your  PI.  Analysis  of  pilot  data  can  provide  a  baseline,  or  identify 


14 


problems  with  the  testing  techniques  or  measures  used.  Your  pilot  subjects  can  be  your  friends, 
family,  or  subjects  taken  from  your  subject  pool. 

If  the  results  from  the  pilot  data  are  not  what  you  expected,  you  can  revise  the  design  of 
experiments  (e.g.,  change  independent  variable,  change  the  target  task,  or  add  another  treatments, 
etc.),  keeping  in  mind  that  the  answer  might  be  that  your  assumptions  are  wrong  and  that  using  a 
small  number  of  subjects  only  allows  you  to  see  large  effects.  Then,  you  will  need  to  gather  more 
pilot  data.  If  the  results  from  the  pilot  data  match  your  expectations,  plan  to  launch  your 
experiments  to  gather  data.  If  not,  an  interesting  new  study  topic  may  have  emerged. 

2.10  Institutional  Review  Board  (IRB)4 

Investigators  in  psychology  or  human  factors  must  obtain  approval  from  the  appropriate  host 
institutions  or  organizations  prior  to  conducting  research.  The  organization  charged  with 
approving  research  applications  in  a  university  setting  is  called  the  Institutional  Review  Board 
(IRB).  The  IRB  is  a  committee  monitoring,  approving,  and  reviewing  biomedical  and  behavioral 
research  involving  humans.  To  protect  the  rights  of  research  participants,  universities  have 
established  an  Institutional  Review  Board  (IRB). 

Before  the  onset  of  the  experiment,  investigators  must  obtain  the  informed  and  voluntary  consent 
of  the  participants  selected  for  the  study.  The  American  Psychological  Association’s  Ethical 
Principles  of  Psychologists  and  Code  of  Conduct5  specifies  that  participants  have  the  right  to 
informed  consent — participants  have  the  right  to  understand  what  will  happen  in  the  study  (e.g., 
any  known  risks  of  harm,  possible  benefits,  and  other  details  of  the  experiment).  Only  after 
receiving  such  a  briefing,  can  a  participant  agree  to  participate  in  the  experiment.  Thus,  the 
details  of  the  experiment  should  be  written  in  clear  jargon  free  language,  and  without  any 
reference  to  special  technical  terms.  The  participants  must  be  able  to  easily  understand  the 
informed  consent  form.  In  addition,  the  form  should  enable  prospective  participants  to  determine, 
for  themselves,  whether  they  are  willing  to  participate  given  his  or  her  situation  and  personal 
tolerance  for  risk.  We  provide  an  example  of  an  informed  consent  form  in  the  Appendix. 

There  are  a  few  exceptions  that  are  worth  noting,  where  IRB  approval  is  not  required.  If  you  are 
running  yourself  and  only  yourself,  you  do  not  need  IRB  approval.  If  you  are  running  studies 
only  for  class  work,  programmatic  improvement  and  not  for  publication,  then  IRB  is  not  required. 
These  exceptions  are  useful  when  you  are  piloting  studies,  or  when  you  are  teaching  (or  learning). 
Of  course,  you  can  in  most  cases  still  seek  IRB  approval  in  these  cases.  The  approval  process 
offers  you  the  opportunity  for  feedback  on  how  to  make  your  study  more  safe  and  efficient. 
Approval  also  allows  later  publication  if  the  results  are  interesting. 

IRB  approval  is  required  before  any  aspect  of  the  study  that  will  be  published  is  conducted, 
including  subject  recruitment.  Without  exception,  IRB  approval  cannot  be  granted  once  the 
study  has  been  conducted.  Consequently,  you  should  seek  IRB  approval  early  in  the  process  and 
keep  your  timeline  and  participant  count  as  “loose”  as  possible.  You  do  not  need  to  seek  new 


4  This  applies  to  research  in  the  US.  You  should  enquire  locally  because  some  countries  do  not 
see  risk  in  routine  cognitive  experimental  projects. 

5  http://www.apa.org/ethics/code2002.html 


15 


approval  for  enrolling  fewer  participants  than  requested  or  finishing  early.  You  will,  however, 
need  to  seek  approval  for  running  behind  or  for  enrolling  a  larger  number  of  participants. 

IRB  policies  are  subject  to  interpretation  so  when  in  doubt  contact  the  IRB  representative  at  your 
institution. 

In  general  IRB  reviews  fall  under  two  categories,  expedited  or  full  review.  Most  behavioral 
science  studies  that  do  not  involve  the  use  of  experimental  drugs,  radiation,  or  medical  procedures 
can  be  considered  for  expedited  review.  Expedited  review  does  not  require  full  IRB  approval, 
and  can  usually  be  accomplished  within  a  few  weeks  (again  this  will  vary  by  institution).  For  all 
other  cases,  you  will  need  to  go  through  a  full  review — these  are  usually  scheduled  far  in  advance 
at  some  specified  interval. 

2.11  Further  Readings 

We  list  some  reading  materials  that  will  help  you  plan  and  run  experiments,  and  report  results 
from  the  experiment. 

•  Rosson,  M.  B.,  &  Carroll,  J.  M.  (2002).  Usability  engineering:  Scenario-based  development 
of  human-computer  interaction.  San  Francisco,  CA:  Morgan  Kaufmann  Publishers. 

This  book  provides  comprehensive  background  of  the  area  of  human-computer  interaction. 

•  Stern,  R.  M.,  Ray,  W.  J.,  &  Quigley,  K.  S.  (2001).  Psychophysiological  recording  (2nd  ed.). 
New  York,  NY:  Oxford  University  Press. 

Psychophysiological  Recording  is  a  very  useful  book  for  anyone  who  conducts  experiments  with 
human  participants  to  measure  their  psychological  or  physiological  responses.  The  book  provides 
not  only  practical  information  regarding  recording  techniques  but  also  the  scientific  contexts  of 
the  techniques. 

•  Nielsen,  J.  (ed.)  (1994).  Special  issue:  Usability  laboratories.  Behaviour  &  Information 
Technology,  73(1-2). 

This  is  a  specially  edited  article  concerning  usability  laboratories.  This  special  issue  provides 
several  representative  usability  laboratories — mostly  computer,  telecommunications,  and 
consumer  product  companies  (e.g.,  IBM,  Symantec,  SAP,  Phillips,  or  Microsoft,  etc.). 

•  Ray,  W.  J.,  &  Slobounov,  S.  (2006).  Fundamentals  of  EEG  methodology  in  concussion 
research.  In  S.  M.  Slobounov  &  W.  J.  Sebastianelli  (Eds.),  Foundations  of  sport-related  brain 
injuries  (pp.  221-240).  New  York,  NY:  Springer. 

This  book  chapter  provides  you  with  background  for  using  EEG  and  its  processes,  including 
physiological  basis  and  frequency  analysis  of  the  EEG.  In  addition,  Ray  and  Slobounov  explain 
EEG  research  on  motor  processes  in  general  and  brain  trauma. 


16 


3  Potential  Ethical  Problems 


There  are  several  topics  that  you  need  to  keep  in  mind  when  running  subjects.  Chief  among  these 
are  the  ethics  pertaining  to  the  running  of  participants,  and  the  gathering  and  reporting  of  data 
including  published  and  unpublished  documents.  If  you  have  any  questions,  you  should  contact 
the  lead  researcher  (or  principal  investigator),  or  other  resources  at  your  university. 

3.1  Recruitment  of  a  Broad  Selection  of  Subjects 

The  results  we  find  we  would  like  to  generalize  to  a  wide  population,  indeed,  the  whole 
population.  It  is  useful  to  recruit  a  representative  population  of  subjects  to  accomplish  this.  It  has 
been  noted  by  some  observers  that  experimenters  do  not  always  recruit  from  the  whole 
population.  In  some  studies,  this  is  a  justifiable  approach  to  ensure  reliability  (for  example,  using 
a  single  sex  in  a  hormonal  study)  or  to  protect  subjects  who  are  at  greater  risk  because  of  the 
study  (for  example,  non-caffeine  users  in  a  caffeine  study). 

Where  there  are  not  threats  to  validity,  experimenters  should  take  some  care  to  include  a 
representative  population.  This  may  mean  putting  up  posters  outside  your  department,  and  it  may 
include  paying  attention  to  sex  balance  and  even  age  balance  in  a  study,  and,  then  correcting  the 
balance  by  recruiting  more  subjects  with  these  features. 

As  the  research  assistant,  you  can  be  the  first  to  notice  this,  and  to  bring  it  to  the  attention  of  the 
investigator. 

3.2  Talking  with  Subjects 

When  you  first  welcome  the  subjects  to  your  study  and  the  study  area,  you  might  feel 
uncomfortable.  After  you  have  run  a  few  sessions,  this  discomfort  will  go  away.  In  a  simple 
study,  you  can  be  quite  natural,  as  there  is  nothing  to  ‘give-away’.  In  more  complex  studies,  you 
will  be  busy  setting  up  the  apparatus,  and  this  tends  to  make  things  easier. 

In  nearly  all  cases,  abstaining  from  extraneous  comment  on  the  study  is  an  important  and  useful 
practice  that  makes  all  parties  concerned  more  comfortable.  Many  experimental  protocols  require 
not  giving  the  subject  feedback  during  the  study.  In  these  cases,  you  should  inform  the 
participants  at  the  beginning  of  the  session  that  you  are  not  allowed  to  provide  them  feedback  on 
their  performance.  Generally,  the  debriefing  can  handle  most  questions,  but  if  you  are  not  sure 
how  to  answer  a  question,  either  find  and  ask  the  investigator,  or,  take  contact  details  from  the 
subject  and  tell  them  you  will  get  them  an  answer.  And  then,  do  it!  This  also  means  that  when 
you  are  running  subjects  for  the  first  couple  of  times  that  someone  who  can  answer  your 
questions  should  be  available. 

In  social  psychology  studies  or  where  deception  is  involved,  you  will  be  briefed  by  the 
investigator  and  will  practice  before  hand.  In  this  area,  practice  and  taking  advice  from  the  main 
investigator  is  important. 


17 


3.3  Coercion  of  Participants 

Coercion  is  an  ethical  violation  of  the  rights  of  human  participants.  It  is  necessary  to  avoid  any 
procedures  in  a  study  that  restrict  participants’  freedom  of  consent  regarding  their  participation  in 
a  study.  Some  participants,  including  minors,  patients,  prisoners,  and  individuals  who  are 
cognitively  impaired  are  more  vulnerable  to  coercion.  For  example,  enticed  by  the  possibility  of 
payments,  minors  might  ask  to  participate  in  a  study.  If,  however,  they  do  so  without  parental 
consent,  this  is  unethical  because  they  are  not  old  enough  to  give  their  consent — agreements  by  a 
minor  are  not  legally  binding. 

Students  are  also  vulnerable  to  exploitation.  The  grade  economy  presents  difficulties,  particularly 
for  course  where  a  lab  component  is  integrated  into  the  curriculum.  In  these  cases,  professors 
must  not  only  offer  an  experiment  relevant  to  the  students’  course  work  but  also  offer  alternatives 
to  participating  in  the  experiment. 

To  address  these  problems,  it  is  necessary  to  identify  any  potential  condition  that  would 
compromise  the  participants’  freedom  of  choice.  For  instance,  in  the  second  example,  recall  that 
it  was  necessary  for  the  professor  to  provide  an  alternative  way  to  obtain  credit.  In  addition,  this 
means  ensuring  that  no  other  form  of  social  coercion  has  influenced  the  participants’  choice  to 
engage  in  the  study.  Teasing,  taunts,  jokes,  inappropriate  comments,  or  implicit  quid  pro  quo 
arrangements  are  all  inappropriate.  These  interactions  can  lead  to  hard  feelings  (that’s  why  they 
are  ethical  problems!),  and  loss  of  good  will  towards  experiments  in  general  and  you  and  your  lab 
in  particular. 

3.4  Sensitive  Data 

When  preparing  to  run  the  study,  you  should  prepare  how  to  deal  with  sensitive  data.  There  are  at 
least  two  issues  here — data  that  you  anticipate  is  sensitive  and  unexpected  data  that  arises  that  is 
sensitive. 

Data  that  is  intrinsically  sensitive  should  be  handled  carefully.  Personal  data  is  the  most 
common.  Information  on  an  individual,  such  as  related  to  race,  creed,  gender,  gender  preference, 
religion,  friendships,  and  so  on,  must  be  protected.  This  data  should  not  be  lost  or  mislaid.  It 
should  not  be  shared  with  people  not  working  on  the  project,  either  formally  if  you  have  an  IRB 
that  requires  notice,  or  informally,  if  your  IRB  does  not  have  this  provision  (this  may  occur  more 
often  outside  of  the  US).  You  should  seek  advice  from  your  colleagues  about  what  practices  are 
appropriate  in  your  specific  context.  In  some  situations,  you  are  not  allowed  to  take  data  from  the 
building,  and  in  most  cases,  you  are  encouraged  to  back  it  up  and  keep  the  backed-up  copy  in 
another  safe  location. 

The  second  type  of  sensitive  data  is  data  that  can  arise  where  the  subject’s  responses  have 
implications  outside  of  the  scope  of  the  study.  This  can  include  subjects  implicating  themselves 
in  illegal  activity,  or  unintentionally  disclosing  an  otherwise  hidden  medical  condition.  For 
example,  if  you  are  administering  caffeine,  and  you  ask  the  subject  what  drugs  they  take  (to  avoid 
known  caffeine  agonists  or  antagonists),  you  may  find  information  about  illegal  drug  use.  If  you 
take  subject’s  heart  rate  or  blood  pressure  measurements,  you  may  discover  symptoms  of 
underlying  disease. 


18 


Generally,  preparation  for  a  study  should  involve  discussions  about  how  to  handle  sensitive  data, 
and  if  there  is  a  chance  that  the  study  may  reveal  sensitive  data  about  the  participants.  You 
should  fully  understand  how  your  institutions  policies  regarding  sensitive  data,  and  how  to  work 
with  the  subjects  when  sensitive  information  becomes  an  issue.  If  you  have  questions,  you 
should  ask  the  principle  investigator. 

3.5  Plagiarism 

Plagiarism  refers  to  taking  other’s  work  or  ideas  and  using  them  as  one’s  own,  that  is,  without 
attribution.  Particularly  in  academia,  this  problem  is  taken  seriously. 

An  individual  might  be  tempted  to  steal  others’  ideas,  research  methods,  or  results  from 
unpublished  or  published  works.  Nowadays,  manuscripts  that  are  about  to  be  submitted  or 
already  submitted  for  review,  can  be  available  online. 

Why  people  are  tempted  to  plagiarize  others’  work?  Generally,  pressure  to  meet  or  surpass 
institutional  standards  causes  people  to  plagiarize.  To  pass  a  programming  class,  students  might 
copy  another  student’s  code.  A  faculty  member,  facing  review  for  tenure  and  stressed  by  the 
number  of  his  or  her  refereed  publications,  or  an  RA  trying  to  fill  in  a  methods  section  all  might 
be  tempted  to  steal  the  work  of  others.  Sometimes,  the  pressure  to  publish,  is  enough  to  tempt  an 
academic  to  plagiarize  other’s  ideas  and  fabricate  their  data. 

The  integrity  and  development  of  scientific  knowledge  is  rooted  in  the  proper  attribution  of 
credit.  In  the  APA’s  publication  manual  (p.  349),  you  can  find  the  APA’s  guidelines  for  giving 
credit.  Direct  quotes  require  quotation  marks  and  citations  while  paraphrasing  or  in  anyway 
borrowing  from  the  work  of  others  requires  a  citation.  You  may  also  need  to  acknowledge  people 
who  give  you  unpublished  ideas  for  your  research  designs.  In  particular,  you  may  have  personal 
communications  (e.g.,  email,  messages  from  discussion  groups  on  the  net,  letters,  memos,  etc.) 
that  require  acknowledgement.  In  this  case,  you  will  need  to  remember  who  gave  you  the  idea 
(an  email  thanking  them  can  be  a  good  way  to  document  this),  and  then  cite  them  in  the  text  with 
a  date. 

3.6  Fraud 

We,  sometimes,  are  shocked  by  news  about  research  fraud.  For  example,  if  a  researcher 
fabricates  data  and  publishes  a  paper  with  the  data,  this  is  fraud.  Other  scientists  trying  to 
replicate  the  results  are  often  the  ones  who  find  and  reveal  the  initial  findings  to  be  fraudulent. 
While  research  fraud  is  unusual,  we,  nevertheless,  must  be  aware  that  fraud  can  cause  significant 
adverse  effects  on  not  only  for  the  perpetrator  of  the  fraud  but  also  often  second  or  third  parties 
such  as  his  or  her  academic  institution,  funding  agency,  or  corresponding  journal  editor.  Or, 
more  distant  people  who  base  an  educational  system  on  a  learning  theory,  or  teaching  strategies 
on  incorrect  data  on  memory. 

If  data  is  lost,  it  is  lost,  do  not  replace  it.  If  you  delete  data,  do  not  replace  it.  If  you  did  not  run  a 
subject,  do  not  run  yourself.  All  of  these  practices  undermine  your  study’s  validity  and  are 
extremely  egregious  ethical  violations.  It  is  sad  when  you  read  in  an  article  that  “data  from  3 
subjects  were  lost”,  but  it  is  far  better  to  write  this  than  to  commit  fraud. 


19 


3.7  Summary 

This  chapter  notes  a  few  of  the  most  important  ethical  problems  you  might  face.  You  may 
encounter  others.  If  you  have  questions,  you  should  contact  the  lead  investigator  or  other  senior 
personnel.  In  some  cases,  as  in  many  ethical  situations,  there  may  not  be  a  right  answer,  there 
may  be  several  right  answers,  and  often  there  are  better  answers  and  good,  accepted  practices. 

3.8  Further  Readings 

Here  is  a  list  of  further  readings  for  you  concerning  this  chapter. 

•  You  should  refer  to  the  APA’s  webpage,  Ethical  Principles  of  Psychologists  and  Code  of 
Conduct.  The  first  version  was  published  in  1992,  but  has  been  superceded  by  a  newer 
release  issued  in  June  2003.  Here  is  the  link  for  the  current  code  of  conduct: 

http  ://www.  apa.org/ ethics/ code2002  .html 

•  American  Psychological  Association.  (2001).  Publication  manual  of  the  American 
Psychological  Association.  Washington,  DC:  American  Psychological  Association. 

The  APA  publication  manual  provides  useful  guidance  for  reporting  your  experimental 
findings  in  a  written  paper. 


20 


4  Risks  to  Validity  to  Avoid  While  Running  an  Experiment 

Understanding  how  subjects  will  complete  the  task  and  working  towards  uniformity  across  all 
iterations  of  the  task  are  important.  The  repeatability  of  the  experiment  is  a  necessary  condition 
for  scientific  validity.  There  are,  however,  several  well  known  effects  that  can  affect  the 
experimental  process.  Chief  among  these  is  the  experimenter’s  effect,  or  the  influence  of  the 
experimenter’s  presence  on  the  participants.  Depending  upon  the  experimental  context,  the 
experimenter  effect  can  lead  to  either  better  or  decreased  performance.  The  magnitude  and  type 
of  effect  that  can  be  attributed  to  this  effect  generally  depends  upon  the  type  and  extent  of 
personal  interaction  between  the  participant  and  experimenter.  Thus,  you  should  strive  to  provide 
each  participant  a  comfortable  but  neutral  testing  experience. 

Besides  the  experimenter  effect,  there  are  other  risks  to  the  experimental  process.  We  hope  here 
to  not  only  highlight  some  but  also  illustrate  how  to  avoid  them,  either  directly  or  through  proper 
randomization.  Randomization  is  particularly  important  because  you  will  most  likely  be 
responsible  for  implementing  treatments  while  understanding  the  other  risks  will  help  you  take 
steps  to  minimize  them.  Finally,  there  are  other  experimental  effects  that  are  outside  of  your 
control — we  do  not  cover  these  here.  Generally,  these  effects  are  associated  with  some 
idiosyncrasy  (usually  an  event)  in  the  testing  environment.  Even  though  you  cannot  eliminate  all 
contingent  events,  you  can  note  idiosyncrasies  and  with  the  principle  investigator  either  correct  or 
report  them  for  future  trials. 

Another  common  source  of  variation  across  trials  is  the  effect  of  the  experimental  equipment. 

For  instance,  if  you  are  having  subjects  interact  with  a  computer  or  other  fixed  display,  you 
should  take  modest  steps  to  make  sure  that  the  participant’s  distance  to  the  display  is  the  same  for 
each  subject — this  does  not  mean,  necessarily,  putting  up  a  tape  measure,  but  in  some  cases,  it 
does.  It  is  necessary  to  be  aware  that  the  viewing  distance  can  affect  more  blurred  vision, 
irritated  eyes,  headache,  and  movement  of  torso  and  head  (e.g.,  Rempel,  Willms,  Anshel, 
Jaschinski,  &  Sheedy,  2007).  The  factors  of  which  can,  thus,  be  risks  to  validity.  Furthermore,  if 
subjects  are  picking  up  blocks  or  cards  or  other  objects,  the  objects  should  either  always  be  in  the 
same  positions,  or  they  should  be  always  randomly  placed  because  some  layouts  of  puzzles  can 
make  the  puzzles  much  easier  to  solve.  The  experimental  set  up  should  not  be  sometimes  one  and 
sometimes  the  other. 

There  will  be  other  effects  where  variation  in  the  apparatus  can  lead  to  unintended  differences, 
and  you  should  take  advice  locally  to  learn  how  to  reduce  them. 

4.1  Validity  Defined:  Surface,  Internal,  and  External 

We  refer  to  validity  as  the  degree  to  which  an  experiment  leads  to  an  intended  conclusion  from 
the  data.  In  general,  two  types  of  validity,  internal  validity  and  external  validity,  are  of  interest. 
Internal  validity  refers  to  how  well  experimental  treatments  explain  the  outcomes  from  the 
experiment.  The  experimental  treatments  indicate  independent  variables  that  you  design. 

External  validity,  in  contrast,  refers  to  how  well  the  outcomes  from  the  experiment  explain  the 
phenomena  outside  the  designed  experiment.  This  is  known  as  “generalizability”. 


21 


Campbell  and  Stanley  (1963)  discusses  12  factors  that  endanger  the  internal  and  external  validity. 
We  need  to  consider  how  to  reduce  or  eliminate  the  effects  from  these  factors  to  guarantee  valid 
results. 

Regarding  internal  validity,  when  you  run  studies  you  may  notice  these  factors.  Good  principle 
investigators  will  appreciate  you  bringing  them  to  their  attention.  You  should  not  panic,  some  of 
these  are  inevitable  in  some  study  formats,  but  if  they  are  unanticipated,  then  they  may  be 
interesting  or  the  study  may  need  to  be  modified  to  avoid  them. 

•  History:  Besides  the  experimental  variable,  a  specific  event  could  occur  between  the  first 
and  second  measurement.  Typically,  this  is  some  news  item  such  as  a  space  launch  or  a 
disaster  that  influences  subjects  in  a  global  way  leading  to  better  or  worse  results  than 
would  occur  at  other  times. 

•  Maturation:  Participants  can  grow  older,  become  hungrier,  or  become  more  tired  with  the 
passage  of  the  time.  Thus,  if  you  measure  students  at  the  beginning  of  the  school  year 
and  then  months  later,  they  may  get  better  scores  based  on  having  taken  classes. 

•  Testing:  The  effects  of  taking  a  test  on  the  scores  of  a  second  test.  Thus,  if  you  take  an 
IQ  test,  the  same  test,  a  second  time.  You  are  likely  to  score  better,  particularly  if  you  got 
feedback  from  the  first  taking. 

•  Instrumentation:  It  is  required  to  calibrate  a  measuring  instrument  regularly.  Some 
instruments  need  to  be  recalibrated  with  changes  in  humidity.  Failure  to  recalibrate  can 
affect  an  experiment’s  results. 

•  Statistical  regression:  We  need  to  avoid  selecting  groups  on  the  basis  of  their  extreme 
scores.  If  you  select  subjects  based  on  a  high  score,  some  of  those  high  scores  will  most 
likely  not  reflect  the  participant’s  normal  performance. 

•  Biases:  Differential  selection  of  participants  for  the  comparison  groups  should  be 
avoided.  Subjects  that  come  early  in  the  semester  to  get  paid  or  credit  are  different  from 
the  subjects  who  put  it  off  until  the  last  week  of  the  semester. 

•Experimental  mortality:  There  could  be  a  differential  loss  of  participants  from  the 
comparison  groups.  Some  conditions  could  be  hard  on  the  subjects,  and  thus  lead  them 
to  come  back  less. 

•  Selection-maturation  interaction:  Given  that  there  are  two  groups.  Participants  in  one 
group  develop  faster  than  participants  in  the  other  group.  In  this  case,  selecting  one  of 
two  groups  that  are  maturing  at  different  rates  concerning  the  outcome  can  cause  the 
posttest  differences.  This  factor  is  an  interaction  effect  that  can  exist  between  the 
subject-related  variable  (e.g.,  age)  and  a  time-related  variable. 

Regarding  external  validity,  you  can  also  notice  these  sometimes. 

•  The  reactive  or  interaction  effect  of  testing:  A  pretest  could  affect  (increase  or  decrease) 
the  participants’  sensitivity  or  responsiveness  to  the  experimental  variable.  Some  pre¬ 
tests  disclose  what  the  study  is  designed  to  study.  If  the  pre-test  asks  about  time  spent 
studying  math  and  playing  math  games,  you  can  bet  that  mathematical  reasoning  is  being 
studied  in  the  experiment. 

•  The  interaction  effects  of  selection  biases  and  the  experimental  variable:  It  is  necessary  to 
acknowledge  that  independent  variables  can  interact  with  subjects  that  were  selected 


22 


from  a  population.  In  this  case,  the  outcome  or  findings  from  the  experiment  may  not  be 
generalized  to  summarize  a  larger  population. 

•  Reactive  effects  of  experimental  arrangements:  An  experimental  situation  itself  can  affect 
the  outcome  that  cannot  be  generalized.  That  is,  the  outcome  can  be  a  reaction  from  the 
specific  experimental  situation. 

•  Multiple-treatment  interference:  If  multiple-treatments  should  be  applied  to  the  same 
participant,  the  participant’s  performance  would  then  not  be  valid  because  of  the 
accumulated  effects  from  those  multiple  treatments.  For  example,  if  you  have  learned 
sample  material  one  way,  it  is  hard  to  tell  if  later  learning  is  the  result  of  the  new  learning 
method  presented  second,  or  the  result  of  the  first  method,  or  the  combination  of  the  two. 

Why  mention  these  in  a  book  on  how  to  run  subjects?  Why  not  just  let  these  be  mentioned  in 
experimental  design?  We  mention  them  here  because  if  you  are  new  RA,  you  may  not  have  had 
an  experimental  design  class.  And  yet,  many  of  these  effects  will  only  be  or  mostly  be  visible  to 
the  person  running  the  study.  If  there  is  an  event  in  a  country  where  you  are  running  subjects  like 
an  election,  and  you  will  be  comparing  results  to  a  different  country  where  the  PI  is  located,  it  is 
the  RA  that  has  the  best  chance  of  noticing  that  something  unusual  that  is  a  threat  to  validity  has 
happened  in  the  study,  whereas  every  one  can  notice  the  global  result. 

4.2  Risks  to  Validity 

4.2.1  Power:  How  many  participants? 

Performance  is  noisy.  Differences  that  appear  could  be  due  to  a  theoretical  manipulation,  or  it 
could  be  due  to  chance.  We  now  discuss  the  power  of  a  statistical  test,  and  how  a  test’s  power 
can  influence  its  effectiveness.  Calculating  the  test’s  power  can  help  maximize  the  benefits  of  a 
set  of  experimental  runs  by  helping  you  decide  how  many  subjects  to  run.  For  instance  while 
relatively  rare,  running  too  many  subjects  can  be  wasteful  when  the  effect  size  is  known  to  be 
large.  There  are  other  issues  that  investigators  need  to  consider,  such  as  participants’  effects  or 
experimenters’  effects.  We  will  take  these  issues  up  in  the  following  section. 

Testing  a  hypothesis  produces  two  outcomes:  (a)  one  outcome  can  be  rejecting  the  null 
hypothesis  (//0),  while  the  other  outcome  (b)  can  be  not  rejecting  the  null  hypothesis — that  is 
accepting  the  alternative  hypothesis  ( Ha ).  When  investigators  decide  to  either  accept  or  reject 
the  alternative  hypothesis,  they  can  make  two  types  of  errors,  known  as  Type  I  and  Type  II  errors. 
Table  2.1  describes  these  errors. 


23 


Table  2.1.  Type  I  and  II  error  in  testing  the  null  ( H0 )  and  experimental  ( Ha )  hypotheses. 


True  State 

Decision  Made 

Hq  is  true 

Ha  is  true 

Type  I  error 

Reject  H0 

(report  a  result, 

Correct  decision 

but  no  effect) 

Type  II  error 

Fail  to  reject  H0 

Correct  decision 

(report  no  result, 
but  there  is  an  effect) 

In  fact,  if  the  null  hypothesis  ( H0 )  is  true,  investigators  should  fail  to  reject  the  null  hypothesis. 
When  the  null  hypothesis  is  incorrectly  rejected  the  null  hypothesis,  Type  I  errors  occur.  The 
probability  of  making  a  Type  I  error  is  denoted  by  a.  On  the  other  hand,  if  the  alternative 
hypothesis  ( Ha )  is  true,  in  fact,  investigators  should  accept  the  alternative  hypothesis.  When  the 
alternative  hypothesis  is  incorrectly  rejected,  Type  II  errors  occur.  The  probability  of  making  a 
Type  II  error  is  denoted  by  /3 . 

The  Power  of  a  test  is  defined  as  the  probability  of  correctly  rejecting  the  null  hypothesis  (H0) 
when  it  is  in  fact  true — this  is  denoted  by  1  -  /3.  In  a  practical  sense,  via  the  calculation  of  the 
Power,  investigators  are  able  to  make  a  statically  supported  argument  that  there  is  a  statically 
significant  difference  when  such  a  difference  truly  exists. 

4.2.2  Experimenter  effects 

When  two  or  more  experimenters  are  running  the  same  experiment,  effects  or  biases  from 
experimenters  can  exist.  Prevent  possible  experimenter  effects  is  necessary  for  guaranteeing  the 
validity  of  the  experiment.  Mitchell  and  Jolley  (2007)  state  reasonable  causes  for  error  that 
investigators  should  avoid:  (a)  the  loose-protocol  effect,  (b)  the  failure-to-follow-protocol  effect, 
and  (c)  the  researcher-expectancy  effect. 

First,  to  avoid  the  loose-protocol  effect,  when  you  run  the  experiment  by  different  experimenters, 
it  is  necessary  to  write  a  document  that  describes  the  procedures  in  detail  and  specifies  exactly 
when  to  use  each  with  each  subject.  The  protocol  document  should  allow  other  experimenters  to 
run  the  experiment  in  exactly  the  same  way,  providing  a  standardized  way  to  run  the  trials.  Once 
you  finished  a  draft  of  the  protocol  document,  you  should  test  it  with  practice  participants. 
Producing  the  final  protocol  document  will  require  a  few  iterations  of  writing  and  then  testing  the 
protocols  with  practice  participants. 

The  second  cause  of  error  results  from  an  experimenter’s  failure  to  follow  the  experiment’s 
protocols.  There  might  be  several  reasons  for  not  following  the  protocol — the  reasons  can 
include  a  lack  of  motivation  to  follow  the  protocol,  or  ignorance  of  the  protocol,  etc. 


24 


The  third  cause  for  error  arises  from  the  influence  of  the  experimenter’s  expectations  upon  his  or 
her  interactions  with  the  participants.  For  instance,  I  might  be  biased  (consciously  or 
unconsciously)  in  how  I  run  the  experiment  if  I  know  I  am  testing  my  hypothesis.  After  all,  I 
have  a  personal  incentive  to  reject  the  null  hypothesis  in  this  case.  Therefore,  it  is  preferable 
when  possible  that  the  experimenters  interacting  with  the  subjects  be  unaware  of  the  hypothesis 
being  tested.  When  this  happens,  it  is  called  a  double-blind  study  (American  Psychological 
Association,  2001).  An  example  would  be  when  the  RA  does  not  know  which  amount  of  caffeine 
a  subject  received,  or  what  condition  the  subject  is  in. 

Following  consistent  written  protocols  in  and  unrushed  manner  is  one  way  to  avoid  many  of  these 
errors.  Please  be  patient  and  give  the  participants  enough  time  to  complete  each  procedure  to  best 
of  their  ability. 

4.2.3  Participant  effects 

Participant  responses  and  personal  characteristics  can  also  invalidate  experimental  results. 
Measurement  strategies  are  one  common  cause  of  error.  Obtrusive  measurements  affect 
participant  performance,  thus  invalidating  the  results.  Suppose  for  example  that  a  participant  is 
working  on  a  task  on  a  computer  screen.  The  participant  is  told  that  the  task  completion  time  is 
measured.  A  running  stopwatch  is  placed  beside  the  participant.  In  this  case,  the  means  of 
measuring  the  task  completion  time  is  not  appropriate  because  of  obtrusiveness  of  the  stopwatch. 
Thus,  it  is  generally  recommend  that  measuring  participants’  performance  should  be  conducted  in 
unobtrusive  ways. 

Because  personal  characteristics  and  histories  influence  performance,  it  is  important  to  try  to 
methodically  achieve  a  representative  sample  when  selecting  participants.  Factors  such  as 
ethnicity,  gender,  age,  experience,  native  language,  or  working  memory  capacity,  etc  can  all 
affect  performance.  Random  assignment  of  subjects  to  conditions  generally  helps  mitigate  this 
effect.  Random  assignment,  however,  can  go  wrong  (or  be  done  incorrectly),  or  result  in  a 
suboptimal  distribution.  RAs  often  are  the  earliest,  best,  and  often  the  only  way  to  discover  these 
problems. 

4.2.4  Randomization 

Randomization  describes  the  process  of  randomly  determining  both  the  allocation  of  the 
experimental  material  and  the  order  in  which  individual  trials  are  to  be  performed  (Montgomery, 
2001).  Random  sampling  is  a  method  for  selecting  the  entire  sample  group.  Ray  (2003)  states 
that  one  way  to  achieve  external  validity  is  to  have  the  participants  in  the  experiment  constitute  a 
representative  sample  of  the  entire  population.  In  fact,  it  is  very  hard  to  accomplish  random 
sampling.  After  randomly  selecting  your  participants  from  the  population,  you  should  randomly 
assign  them  to  their  experimental  groups. 

Statistical  methods  require  that  the  observations  should  be  independently  distributed  random 
variables.  Proper  randomization  of  the  experiment  makes  the  assumption  that  the  independent 
distribution  of  observed  data  is  valid  and  allows  us  to  statistically  analyze  the  behavioral  data. 
Randomization  also  can  be  useful  to  alleviate  bias  of  selecting  participants. 


25 


In  some  situations,  Montgomery  (2001)  states  that  it  is  difficult  to  randomize  the  experiment 
because  of  a  hard-to-change  variable  (e.g.,  temperature  in  a  chemical  process,  subject’s  gender). 

4.3  Example  Problems 

We  can  note  a  few  problems  that  arose  with  lessons  they  provide. 

One  study  found  that  the  subjects  behaved  unexpectedly.  The  subjects  had  less  problems  than 
was  expected.  Upon  further  investigation,  it  turned  out  that  the  student  research  assistants  were 
breaking  up  the  lessons  into  subparts  to  facilitate  learning.  This  is  one  example  that  should  be 
avoided  to  increase  validity  of  research  studies  by  eliminating  lesson  effects  through  instruction 
(e.g.,  VanLehn,  2007). 

4.4  Further  Readings 

Here  is  a  list  of  further  reading  materials  concerning  this  chapter. 

Cohen,  J.  (1992).  A  power  primer.  Psychological  Bulletin,  112 ,  155-159. 

Cohen,  J.  (1992).  Statistical  power  analysis.  Current  Directions  in  Psychological  Science,  7(98- 

101). 

Cohen  originated  the  current  measure  of  power. 

Howell,  D.  C.  (2007).  Statistical  methods  for  psychology  (6th  ed.).  Belmont,  CA:  Thomson. 

Howell’s  book  provides  a  useful  summary  of  how  to  apply  power  written  for  those  learning 
statistics.  Other  introductory  statistics  book  will  have  similar  treatments.  They  are  useful 
introductions  to  this  process. 


26 


5  Running  a  Research  Study 

This  chapter  provides  practical  information  on  what  to  do  when  you  run  your  experiments.  We 
assume  that  you  have  developed  your  initial  experimental  design  and  are  now  ready  to  run  a  pilot 
study. 


5.1  Script 

Your  research  study  will  likely  have  a  script  of  how  to  run  the  session.  If  it  does  not,  it  should, 
and  it  will  help  you  run  each  subject  in  a  consistent  manner.  The  script  will  often  start  with  how 
to  setup  the  apparatus.  Before  the  subject’s  arrival,  the  experimenter  needs  to  setup  the  apparatus 
and  should  be  ready  to  welcome  the  subject.  Incorrect  or  inconsistently  applied  procedures  of  the 
apparatus  setup  can  sometimes  cause  inconsistency  of  the  study-running  processes  (e.g.,  omission 
of  a  step).  Consequently,  the  script  that  appropriately  represents  required  procedures  could  play 
an  important  role  to  conduct  a  successful  experimental  study.  Appendix  D  provides  an  example 
script  of  how  to  run  a  study. 

5.2  Piloting 

As  mentioned  before,  conducting  a  pilot  study  based  on  the  script  of  the  research  study  is 
important.  Piloting  can  help  you  determine  whether  your  experimental  design  will  successfully 
produce  scientifically  plausible  answers  to  your  inquiries.  If  any  revision  is  necessary,  it  is  far 
better  to  find  it  and  correct  it  before  running  multiple  subjects,  particularly  when  access  to 
subjects  is  limited.  It  is,  therefore,  helpful  to  think  of  designing  experiments  as  an  iterative 
process  characterized  by  a  cycle  of  design,  testing,  and  redesign.  In  addition,  you  are  likely  to 
find  that  this  process  works  in  parallel  with  other  experiments,  and  may  be  informed  by  them 
(e.g.,  lessons  learned  from  ongoing  related  lab  work). 

We  also  highly  recommend  that  you  use  pilot  studies  to  test  your  written  protocols  (e.g., 
instructions  for  experimenters).  The  pilot  phase  provides  experimenters  the  opportunity  to  test 
the  written  protocols  with  practice  participants,  and  are  important  for  ironing  out 
misunderstandings,  discovering  problematic  features  of  the  testing  equipment,  and  identifying 
other  conditions  that  might  influence  the  participants.  Revisions  are  a  normal  part  of  the  process; 
please  do  not  hesitate  to  revise  your  protocols.  This  will  save  time  later. 

It  is  also  useful  at  this  stage  to  write  the  method  section  of  your  paper.  Not  only  is  your  memory 
much  fresher  but  also  you  can  show  other  researchers  your  method  section  and  receive 
suggestions  from  them.  These  suggestions  can  save  you  a  lot  of  time,  in  that  these  reviews 
essentially  constitute  another  way  of  piloting  the  study. 

5.3  Dress  Code  for  Experimenters 

You  should  consider  the  impression  you  wish  to  make  and  will  make  when  running  your 
experiment.  This  consideration  should  include  how  your  position,  the  type  of  experiment,  and  the 
type  of  participants  you  are  interacting  will  influence  the  experiment. 


27 


In  most  cases,  we  recommend  wearing  a  somewhat  formal  (or  perhaps  called  business  casual) 
clothing:  a  dress  shirt  with  dress  slacks,  when  running  experiments.  This  helps  you  look 
professional  and  prepared  but  not  intimidating.  Somewhat  formal  dress  helps  convey  the 
experiment’s  importance  while  not  overwhelming  the  participant.  Encouraging  your  subjects  to 
take  the  experiment  seriously  should  lead  to  more  distinctive  but  still  generalizable  effects. 

5.4  Welcome 

As  the  experimenter  you  are  taking  on  a  role  similar  to  that  of  a  host,  thus,  it  is  appropriate  to 
welcome  participants  to  the  study.  Where  it  is  appropriate,  you  might  provide  them  materials  to 
read  if  they  have  to  wait,  and  to  answer  questions  they  have  before  the  study  begins.  It  is  also 
very  appropriate  to  confirm  their  names  (for  class  credit),  and  to  confirm  for  them  that  they  are  in 
the  right  place  and  at  the  right  time.  If  the  experimental  protocol  permits  it,  you  might  also 
indicate  how  long  the  study  will  take.  This  helps  set  the  stage  for  the  study  itself. 

5.5  Missing  Subjects 

In  every  study,  there  are  two  key  parties — the  experimenter  and  the  subject  or  subjects. 

Inevitably,  you  will  encounter  a  situation  where  a  participant  does  not  show  up  despite  having  an 
appointment.  While  participants  should  notify  you  in  advance  if  they  are  going  to  be  absent,  keep 
in  mind  that  missed  appointments  do  happen,  and  plan  around  this  eventuality.  Participants  are 
volunteers  (even  when  you  consider  compensation).  Therefore,  it  is  appropriate  to  be  gracious 
about  their  absence.  Where  possible,  we  recommend  offering  to  reschedule  once.  When  there 
are  repeated  absences,  it  is  often  not  worth  rescheduling. 

In  some  cases,  you  as  an  experimenter  may  need  to  cancel  an  experiment.  As  an  experimenter,  it 
is  not  acceptable  to  simply  not  show  up  for  an  experiment.  When  you  really  have  to  cancel  the 
experiment  for  any  reason,  you  should  do  it  in  advance.  Furthermore  as  the  experimenter,  you 
have  the  responsibility  to  cancel  the  experiment  by  directly  contacting  the  participants. 

5.6  Decorum 

Be  culturally  sensitive  and  respectful  to  the  participants.  Consult  with  the  lead  investigator  if  you 
have  general  questions  concerning  lab  etiquette,  or  specific  questions  related  to  the  study. 

5.7  Debriefing 

The  APA’s  ethical  principles  offer  a  general  outline  of  debriefing  procedures.  For  many 
experiments,  the  lead  researcher  may  provide  additional  guidance.  Investigators  should  ensure 
that  participants  acquire  appropriate  information  about  the  experiment  and  the  user  study — such 
as  the  nature,  results,  and  conclusions  of  the  research.  If  participants  are  misinformed  on  any  of 
these  points,  investigators  must  take  time  to  correct  these  misunderstandings.  Also,  if  any 
procedures  are  found  to  harm  a  participant,  the  research  team  must  take  reasonable  steps  to 
alleviate  that  harm. 

The  experiment’s  procedures  may  cause  participants  to  feel  uncomfortable  or  be  alarmed.  After 
the  experiment  is  finished,  investigators  or  experimenters  should  listen  to  the  participants’ 


28 


concerns  and  try  to  address  these  problems.  Mitchell  and  Jolley  (2007)  provide  reasonable  steps 
to  follow  when  you  need  to  debrief: 


•  Correct  any  misconceptions  that  participants  may  have. 

•  Give  a  summary  of  the  study  without  using  technical  terms  and  jargon. 

•  Provide  participants  an  opportunity  to  ask  any  questions  that  they  might  have. 

•  Express  thankfulness  to  the  participant. 

When  you  have  a  study  that  can  be  perceived  as  being  deceptive  or  when  the  study  is  a  double¬ 
blind  study,  you  should  seek  advice  about  how  to  debrief  the  participants.  If  deception  is  a 
procedural  component,  you  will  most  likely  have  to  explain  this  to  the  subjects,  and  ask  that  they 
not  discuss  the  study  until  the  study’s  completion  date.  Requesting  the  participants  to  refrain 
from  discussing  the  study  will  help  keep  potential  subjects  from  being  biased. 

To  review,  double-blind  studies  prescribe  that  neither  the  subject  nor  the  experimenter  knows 
which  treatment  the  subject  has  received.  For  example,  the  amount  of  caffeine  any  single 
participant  has  ingested  in  a  caffeine  study  with  multiple  possible  doses.  In  these  cases,  you  will 
have  to  explain  the  procedures  of  the  study,  as  well  as  provide  a  general  rational  for  double-blind 
trials.  Otherwise,  participants  may  balk  at  being  given  a  treatment  in  a  sealed  envelope,  or  by  a 
person  who  is  not  the  experimenter.  Furthermore,  events  such  as  the  Tuskegee  and  Holmes 
Prison  experiments  underscore  why  procedural  transparency  is  so  essential. 

5.8  Payments  and  Wrap-up 

At  the  end  of  the  session,  you  should  be  sure  to  compensate  the  subject  as  specified  by  the 
disclosure  agreement.  Compensation  can  include  monetary  payment,  credit  towards  a  class,  or 
nothing.  If  you  are  paying  them  monetarily,  check  with  your  supervisor,  as  there  are  nearly 
always  detailed  instructions  for  how  to  process  such  payments.  In  any  case,  you  should  make 
sure  that  they  receive  their  compensation;  you  receive  any  required  documentation  such  as 
receipts;  and  that  you  thank  each  participant  for  their  assistance.  Without  them  after  all,  you 
cannot  run  the  study. 

5.9  Simulator  Studies 

You  may  find  yourself  running  simulated  subjects.  User  models  and  simulations  are  increasingly 
used,  both  as  standalone  objects,  but  sometimes  as  part  of  a  study  to  provide  a  social  context.  For 
example,  to  model  a  social  situation  you  might  have  two  intelligent  agents  act  as  confederates  in  a 
resource  allocation  game  (Nerb,  Spada,  &  Ernst,  1997).  These  agents  provide  a  known  social 
context  in  that  their  behavior  is  known  and  can  be  repeated,  either  exactly  or  according  to  a 
known  set  of  knowledge. 

When  you  run  simulations  as  subjects,  you  should  keep  good  notes.  There  are  often  differences 
between  the  various  versions  of  any  simulation,  and  this  should  be  noted.  Simulations  will  also 
produce  logs,  and  these  logs  should  be  stored  as  securely  and  as  accurately  as  subject  logs.  There 
may  be  more  of  them,  so  annotating  them  is  very  prudent. 


29 


5.10  Problems  and  How  to  Deal  with  Them 


When  you  run  an  experiment,  you  can  encounter  unexpected  situations  in  which  a  participant  is 
exposed  to  some  risk  of  harm.  Investigators  must  be  committed  to  resolving  these  problems 
ethically;  recognizing  that  the  well-being  of  the  participants  supercedes  the  value  of  the  study. 
We  recommend  consulting  your  host  organization  (i.e.,  Office  for  Research  Protection)  in  the 
event  that  you  encounter  problems  that  hinder  conducting  experiments  by  affecting  either  the 
experimenters  or  participants.  Where  these  events  are  adverse  enough,  you  are  required  to  report 
these  events  to  the  IRB  board. 

5.11  Example  Problems 

We  can  note  a  few  problems  that  we  have  encountered  and  some  lessons  learned. 

In  one  study,  we  could  not  run  a  few  subjects  because  they  could  not  find  the  room  in  which  we 
were  conducting  the  experiments.  A  locked  hallway  entry  door  and  no  escort  prevented  the 
participants  from  finding  the  room.  Ostensibly,  the  pilot  study  would  have  identified  this 
problem;  however,  only  intra-departmental  personnel  participated  in  the  pilot  study. 
Consequently,  the  need  for  an  escort  was  not  identified  until  the  first  experimental  runs.  This 
example  highlights  the  importance  of  knowing  your  participant  and  their  needs,  as  well  as  the 
limitations  of  internal  pilot  studies. 

In  another  study,  a  colleague  lost  valuable  data  because  of  a  hard  drive  failure.  Spending  years 
gathering  data  on  children,  he  had  not  backed  up  his  data.  When  his  hard  drive  crashed,  he  was 
given  the  choice  to  spend  an  extra  year  rerunning  subjects  (if  support  was  available),  or  go  to 
industry.  He  went  to  industry  where  sadly  he  was  very  happy! 


30 


6  Concluding  a  Research  Session  and  Study 

This  section  explains  practical  information  about  what  you  should  do  when  you  get  done  with 
your  experiment. 

6.1  Data  Care,  Security,  and  Privacy 

All  information  and  data  gathered  from  an  experiment  should  be  considered  confidential.  If 
others  who  are  not  associated  with  the  experiment  have  access  to  either  data  or  personal 
information,  the  participants’  privacy  is  violated.  Thus,  it  is  the  responsibility  of  lead  researchers 
and  experimenters  to  ensure  that  all  security  assurance  procedures  are  promulgated  and  enforced. 

Researchers  must  safeguard  against  the  inappropriate  sharing  of  sensitive  information.  Personal 
information  about  the  participants  must  not  be  shared  with  people  not  associated  with  the  study. 
Thus,  the  data  should  not  be  left  untended.  In  most  studies,  experimental  data  are  kept  in  locked 
files  or  on  secure  computers.  The  level  of  security  may  vary  with  the  type  of  data.  Anonymous 
reaction  time  data,  where  the  only  identifying  information  is  a  subject  ID,  is  low  risk.  Personal 
health  records  where  the  subjects  might  be  identified  are  much  more  sensitive,  and  would  require 
more  cautious  storage,  perhaps  being  used  only  on  a  removable  disk. 

6.2  Data  Backup 

To  protect  against  data  loss,  back  up  all  of  your  data  routinely  (after  running  a  subject,  and  every 
10  days  at  minimum  when  you  are  doing  analyses  of  the  data).  If  your  data  is  stored  in  electronic 
files,  store  them  in  a  secure  hard  drive  or  bum  them  onto  a  CD.  If  you  are  using  paper 
documents,  they  can  be  scanned  and  stored  on  a  computer  file  as  back  up.  We  suggest  that  you 
back  up  your  data  after  each  subject  rather  than  weekly  while  conducting  a  study. 

6.3  Chance  for  Insights 

Gathering  data  directly  can  be  tedious,  but  it  can  also  be  very  useful.  Gathering  data  gives  you  a 
chance  to  obtain  insights  about  aspects  of  behavior  that  are  not  usually  recorded,  such  as  the 
user’s  affect,  their  posture,  and  their  emotional  responses  to  the  task. 

Obtaining  these  kinds  of  insights  and  the  intuition  that  follows  from  these  experiences  is 
important  for  everyone,  but  gathering  data  is  particularly  important  for  young  scientists.  It  gives 
them  a  chance  to  see  how  previous  data  has  been  collected,  and  how  studies  work.  Reading  will 
not  provide  you  this  background  or  the  insights  associated  with  it,  rather  this  knowledge  only 
comes  from  observing  the  similarities  and  differences  that  arise  across  multiple  subjects  in  an 
experiment. 

So,  be  engaged  as  you  mn  your  study  and  then  perform  the  analysis.  These  experiences  can  be  a 
source  for  later  ideas,  even  if  you  are  doing  what  appears  to  be  a  mundane  task.  In  addition, 
being  vigilant  can  reduce  the  number  and  severity  of  problems  that  you  and  the  lead  investigator 
will  encounter.  Often,  these  problems  may  be  due  to  changes  in  the  instmment,  or  changes  due  to 
external  events.  For  example,  current  events  may  change  word  frequencies  for  a  study  on 


31 


reading.  Currently,  words  such  as  bank,  stocks,  and  mortgagees  are  very  common,  whereas  these 
words  were  less  prevalent  three  or  four  years  ago. 


32 


7  Example  Research  Studies 

We  present  example  studies.  In  these  examples,  we  show  how  to  plan  and  prepare  to  run 
experiments  with  human  participants.  This  section  can  help  you  obtain  practical  information  for 
your  own  study. 

7.1  Skill  Retention  Study 

We  draw  our  example  from  a  study  investigating  the  learning  and  forgetting  performance  of 
human  participants  across  a  set  of  procedural  tasks — the  Office  of  Naval  Research  sponsored  this 
work.  We  present  here  specific  procedures  pertaining  to  the  planning  and  running  of  the  study. 

Creation  of  Study  Paradigm :  We  created  a  study  paradigm  to  investigate  how  people  forget  what 
they  have  learned  in  a  laboratory  setting.  For  this  study  paradigm,  we  had  to  create  a  task  that 
was  novel  enough  to  measure  learning  effects  from  participants  (Kim,  2008).  Thus,  we  chose  a 
free  spreadsheet  called  Dismal  (Ritter  &  Wood,  2005)  so  that  we  could  minimize  any  factors  that 
would  decrease  the  data’s  validity.  That  is,  the  task  of  working  with  a  spreadsheet  was  familiar  to 
the  learners,  but  the  Dismal  spreadsheet  has  not  been  used  by  many  people.  We  also  needed  a 
tool  to  unobtrusively  record  the  participant  while  he  or  she  was  performing  the  task;  we  chose 
RUI  for  this  purpose.  We  then  designed  the  experiment —  the  study  uses  a  repeated  measures 
design  to  allow  multiple  measures  of  learning  and  forgetting  from  each  participant. 

Here  is  a  list  of  items  that  you  can  use  to  develop  your  own  study. 

•  A  task 

•  A  task  environment 

•  A  tool  to  record  behavior 

The  First  Pilot  Test :  We  tested  the  study  with  a  couple  of  pilot  subjects.  In  the  first  pilot  study, 
we  had  two  male  participants.  We  observed  that  participants  had  difficulty  in  learning  the  task. 

Revising  the  Task  Design :  We  decided  to  reduce  the  cognitive  load  of  the  target  task  so  that  we 
could  measure  learning  effects  in  a  restricted  time.  We  reduced  the  task  difficulty  so  that  most  of 
the  learning  was  of  the  interface  and  not  of  the  math  in  the  task. 

The  Second  Pilot  Test :  For  the  second  pilot  test,  we  had  a  female  and  a  male  participant  who  had 
no  prior  knowledge  of  the  task.  The  task  completion  times  for  each  were  consistent  with  previous 
studies.  In  addition,  the  participants  showed  some  forgetting  over  time,  which  was  what  we  were 
studying.  We  concluded  that  the  revised  design  was  satisfactory.  In  some  cases,  it  might  be 
necessary  to  iteratively  revise  the  study  design  further. 

Getting  IRB  Approval.  We  prepared  documents  to  receive  IRB  approval.  To  do  this,  we  needed 
to  provide  detailed  protocols  specifying  how  we  would  run  the  experiments,  as  well  as  detailed 
methods  for  how  we  intended  to  recruit  participants. 

Start  Running  Experiments'.  After  getting  IRB  approval,  we  started  running  the  main  experiment. 


33 


In  addition  to  these  sequential  steps,  we  would  like  to  share  with  you  some  practical  information 
pertaining  to  the  IRB  process.  When  you  prepare  documents  for  the  IRB,  the  forms  will  require 
you  to  address  the  following  items.  We  also  include  the  responses  for  the  learning  and  retention 
study. 

(a)  The  benefits  of  the  study: 

From  your  participation,  it  is  expected  to  obtain  data  representing  how  much  knowledge 
and  skills  can  be  retained  in  the  memory  over  time.  This  research  can  contribute  to  design 
a  novel  training  program. 

(b)  Any  known  risks  to  the  participant: 

There  is  no  risk  to  your  physical  or  mental  health.  During  the  experiment,  you  can  take  a 
break  at  any  time. 

(c)  How  to  achieve  the  participant’s  privacy: 

Your  participation  and  data  are  entirely  confidential.  Personal  identification  numbers 
(e.g.,  PSU  ID)  will  be  destroyed  after  gathering  and  sorting  the  experimental  data. 
Without  personal  identification,  the  gathered  data  will  be  analyzed  and  used  for 
dissertation  and  journal  publications.  The  following  may  review  and  copy  records  related 
to  this  research:  The  Office  of  Human  Research  Protections  in  the  U.S.  Department  of 
Health  and  Human  Services,  the  Social  Science  Institutional  Review  Board,  and  the  PSU 
Office  for  Research  Protections. 

(d)  Voluntary  participation: 

The  participation  of  this  study  is  purely  based  on  volunteerism.  You  can  refuse  to  answer 
any  questions.  At  any  time,  you  can  stop  and  decline  to  continue  the  experiment.  There  is 
no  penalty  or  loss  of  benefits  if  you  refuse  to  participate  or  stop  at  any  time. 

(e)  Compensation  from  the  experiment: 

Participants  will  receive  monetary  compensation  of  $25,  $30,  or  $35  based  on  your  total 
number  of  sessions,  or  extra  credits  (students  registered  in  1ST  331).  The  experiment 
consists  of  5  to  7  trials  ($5  per  trial).  The  compensation  will  be  given  as  one  lump  sum 
after  all  trials.  For  the  amount  of  $30  and  $35,  participants  will  receive  a  check  issued  by 
Penn  State.  Others  will  receive  $25  cash.  Total  research  payments  within  one  calendar 
year  that  exceed  $600  will  require  the  University  to  annually  report  these  payments  to  the 
IRS.  This  may  require  you  to  claim  the  compensation  that  you  receive  for  participation 
in  this  study  as  taxable  income.  For  students  in  1ST  331,  you  will  receive  3%  added  to 
the  total  grade  in  the  course.  If  you  do  not  wish  to  take  part  in  the  research,  you  may  earn 
the  extra  credit  by  completing  the  following: 

•  Choose  a  task  to  measure  your  learning  and  forgetting  performance 

•  Gather  your  learning  and  forgetting  data  for  4  hours  of  study  and  test  and  1  retention 

•  Analyze  the  data  to  show  the  total  task  time  per  study  session  and  test  session 


One  thing  that  we  want  to  note  here  is  that  investigators  should  neither  implicitly  or  explicitly 
force  participants  to  participate  in  the  experiment.  As  noted  above,  researchers  recruiting  students 


34 


who  are  enrolled  in  their  classes  must  be  particularly  mindful  of  how  they  frame  student 
participation  in  a  study.  When  investigator  use  academic  credits  (extra  or  otherwise)  for 
compensation,  a  balanced  alternative  must  be  made  available. 


35 


8  Afterword 


There  are  a  many  books  available  about  research  methods  and  related  statistical  analyses.  We, 
however,  realized  that  students  usually  do  not  have  a  chance  to  learn  how  to  run  their  own 
experiments,  and  that  there  are  no  books  that  teach  students  practical  information  about  running 
experiments  with  human  participants. 

Students  charged  with  running  experiments  frequently  lack  specific  domain  knowledge  in  this 
area.  Consequently,  young  researchers  chronically  make  preventable  mistakes.  With  this  book, 
we  hope  to  assist  students  as  they  begin  to  obtain  hands-on  knowledge  about  running 
experiments.  The  topics  and  guidance  contained  in  this  book  arise  from  the  authors’  collective 
experience  in  both  running  experiments  and  mentoring  students. 

Further  methods  of  gathering  data  are  being  developed.  Though  these  changes  will  impact  the 
development  of  future  experimental  procedures,  the  gross  structures  of  a  study  and  the  aspects  we 
have  discussed  here  are  not  likely  to  change. 

As  you  venture  into  research,  you  will  find  new  topics  that  will  interest  you.  We  are  not  able  to 
examine  all  populations  or  touch  upon  measurements  and  tools  that  require  additional  training  in 
this  text.  Consequently,  we  are  not  able  to  cover  in  detail  the  collection  of  biological  specimens, 
eye-tracking,  or  fMRI;  however  with  further  reading  and  consultation  with  colleagues,  you  will 
be  able  to  master  these  skills. 

Running  studies  is  often  exciting  work,  and  it  helps  us  understand  how  people  think  and  behave. 
It  offers  a  chance  to  improve  our  understanding  in  this  area.  We  wish  you  good  luck,  bonne 
chance  in  finding  new  scientific  results. 


36 


Appendix  A:  Glossary 


Independent  variable 

A  variable  that  is  manipulated  in  the  study,  either  by  assignment  of 
materials  or  assignment  of  subjects. 

Dependent  variable 

A  measurement  that  is  taken  during  the  study,  such  as  reaction 
time,  or  percent  correct.  It  depends  on  other  things. 

Pilot  study 

An  abbreviated  version  of  the  study  done  to  test  the  procedure  and 
prepare  for  a  larger  study. 

Power 

The  power  in  an  experimental  study  indicates  the  probability  that 
the  test  (or  experiment)  will  reject  a  false  null  hypothesis.  Failure 
to  reject  the  null  hypothesis  when  the  alternative  hypothesis  is  true 
is  referred  to  as  a  Type  II  error.  Thus,  as  the  power  of  a  study 
increases,  the  chances  of  a  Type  II  error  decrease. 

IRB 

Internal  Review  Board.  They  review  study  proposals  to  ensure 
safety  and  compliance  with  US  federal  regulations. 

Informed  consent  form 

Null  hypothesis 

The  hypothesis  that  the  treatment  DOES  NOT  lead  to  differences. 

For  example,  the  null  hypothesis  might  be  that  two  interfaces  are 
equally  easy  to  use. 

37 


Appendix  B:  A  Checklist  for  Setting-up  Experiments 

As  an  experimenter  or  a  principal  investigator  for  your  project,  you  need  to  complete  the  items 
below  to  set  up  experiments  to  run. 


□  Prepare  for  the  IRB  form  and  submit  it  to  office  of  research  protection 

□  Run  pilot  tests  to  make  sure  your  experimental  design 

□  Advertise  your  experiment  to  recruit  participants  (e.g.,  flyer,  a  student  newspaper) 

□  Schedule  your  participants  for  an  experiment 

□  Make  sure  a  lab  for  the  experiment  available  when  you  need  to  run 

□  Prepare  how  to  debrief 


38 


Appendix  C:  A  Overview  of  Steps  for  Running  Experiments 

As  an  experimenter  or  an  investigator,  you  need  to  consider  the  items  such  as  those  listed  below 
to  run  your  actual  experiments. 


□  Recruit  participants  for  the  experiment 

□  Pilot  the  study 

Explain  detailed  information  about  the  experiment  (e.g.,  risks,  benefits,  and  the  purpose  of 
the  experiment) 

□  Let  participants  know  they  can  stop  participation  and  performance  at  any  time 

□  Protect  the  participant’s  confidentiality 

□ 

Be  ready  to  address  any  harms  or  risks  from  the  experiment  by  talking  with  the  lead 
investigator 

□ 

□  Make  sure  ethical  codes  by  APA  are  being  considered 

□  Archive  the  data 

□  Analyse  data 

□  Report  data 


39 


Appendix  D:  Example  Script  to  Run  an  Experiment 

This  is  one  page  script  that  every  experiment  should  read  and  follow. 

Experimenter’s  Guide 

This  is  an  example  script  for  an  experiment.  Every  experimenter  should  follow  the  procedures  to  run  a  user 
study  about  skill  retention. 

(1)  Check  your  dress  code 

(2)  Before  your  participants  are  coming  in,  you  need  to  set  up  a  set  of  the  experiment  apparatus. 

a)  Start  RUI  in  the  Terminal  Window,  (see  details  ..) 

b)  Start  the  Emacs  text  editor. 

c)  Prepare  disposable  materials,  handouts,  such  as  informed  consent  form 

(3)  Welcome  your  participants 

(4)  Put  a  sign  on  the  door  indicating  that  you  are  running  subjects  when  the  experiment  starts 

(5)  Give  the  IRB  form  and  have  them  read  it 

(6)  If  they  consent  to  it,  start  the  experiment 

(7)  Briefly  explain  what  they  are  going  to  do 

(8)  Give  them  the  study  booklet. 

a)  Participants  can  use  30  min  maximum  to  study  the  booklet. 

(9)  While  participants  are  reading  the  booklet,  you  can  answer  their  questions  about  the  task. 

(10)  Turn  on  the  monitor  that  is  located  in  the  experimental  room,  so  that  you  can  monitor  the  participant 
outside  the  room. 

(11)  When  the  experiment  is  finished,  give  an  explanation  about  the  payments  or  extra  credit.  Also,  if 
there  are  any  additional  schedules  for  later  measures,  remind  them. 

(12)  Take  down  the  sign  on  the  door  when  the  experiment  is  done 

(13)  Copy  the  data  to  external  hard  drive 

(14)  Shut  down  apparatus 

(15)  Make  supplies  for  next  subject 


Using  RUI 

RUI  (Recording  User  Input)  will  be  used  to  log  keystrokes  and  mouse  actions  of  the  participant.  RUI 
requires  Mac  OS  X  10.3  (Panther)  or  later  versions.  It  has  been  tested  up  to  Mac  OS  X  10.4.3.  (Tiger).  In 
order  for  RUI  to  record  user  inputs,  “Enable  access  for  assistive  devices”  must  be  enabled  in  the  Universal 
Access  preference  pane. 

( 1 )  Launch  T erminal 

(2)  In  Terminal,  type  the  below  information: 

./rui  -s  “Subject  Name”  -r  -/Desktop/ruioutput.txt 

(3)  You  will  get  this  message: 

mi:  standing  by  -  press  ctrl+r  to  start  recording. . . 

(4)  Press  “CTRL+r” 

(5)  To  stop  recording,  press  “CTRL+s” 

Note: 

If  you  see  the  message  of  “-bash:  ./mi:  Permission  denied”  in  the  Terminal  window,  you  need  to  type 
“chmod  a+x  mi”  while  you  are  in  the  RUI  directory. 


40 


Measuring  Learning  &  Forgetting 

Emacs  is  started  by  the  experimenter  for  every  session.  The  participants  will  start  and  stop  RUI  to  record 
their  performance.  The  experimenter  needs  to  ensure  that  the  participants  cannot  do  mental  rehearsal  during 
the  retention  period. 


41 


Appendix  E:  Safety  of  Experiments 

Some  common  safety  concerns: 
for  cog  psy,  there  are  none 

for  interesting  things,  see  your  irb 
for  stress,  see  your  irb 

for  taking  samples  from  humans,  see  your  IRB 


42 


Appendix  F:  Example  Consent  Form 

Here  is  an  example  of  an  informed  consent  form  that  you  can  refer  to  when  you  need  to  generate 
one  for  your  experiment. 


Informed  Consent  Form  for  Biomedical  Research 

The  Pennsylvania  State  University 


Title:  Investigating  a  Forgetting  Phenomenon  of  Knowledge  and  Skills 


ORP  USE  ONLY:  IRB#21640  Doc.  #1 

The  Pennsylvania  State  University 
Office  for  Research  Protections 
Approval  Date:  09/09/2008  -  J.  Mathieu 
Expiration  Date:  09/04/2009  -  J.  Mathieu 
Biomedical  Institutional  Review  Board 


Principal  Investigator:  Dr.  Frank  E.  Ritter 

316G  1ST  Bldg,  University  Park,  PA  16802 
(814)  865-4453  frank.ritter@psu.edu 


Other  Investigators: 

Dr.  Richard  J.  Koubek 
310  Leonhard  Building 
University  Park,  PA  16802 
(814)  865-7601  rkoubek@psu.edu 


Dr.  Jong  Wook  Kim 
316E  1ST  Building 
University  Park,  PA  16802 
(814)  865-6166;  jongkim@psu.edu 


1.  Purpose  &  Description:  The  purpose  of  the  study  is  to  investigate  how  much  knowledge  and  skills 
are  forgotten  and  retained  in  human  memory  after  a  series  of  learning  sessions.  Human  performance 
caused  by  forgetting  will  be  quantitatively  measured.  If  you  decide  to  take  part  in  this  experiment, 
please  follow  the  experimenter’s  instruction. 


The  experiment  is  held  at  319  (Applied  Cognitive  Science  Lab.)  or  205  (a  computer  lab)  1ST  building. 
During  the  experiment,  the  timing  of  keystrokes  and  mouse  movements  will  be  recorded. 


A  group  of  participants  (80  participants)  selected  by  chance  will  wear  an  eye-tracker  to  measure  eye 
movements  during  the  task,  if  you  consent  to  wear  the  device.  You  can  always  refuse  to  use  it.  The 
eye-tracker  is  a  device  to  measure  eye  positions  and  eye  movements.  The  eye-tracker  is  attached  to  a 
hat,  so  you  just  can  wear  the  hat  for  the  experiment.  The  device  is  examined  for  its  safety.  You  may  be 
asked  to  talk  aloud  while  doing  the  task. 

2.  Procedures  to  be  followed: 

a.  You  will  be  asked  to  study  an  instruction  booklet  to  learn  a  spreadsheet  task  (e.g.,  data 
normalization).  Each  study  session  will  be  30  minutes  maximum.  For  four  days  in  a  row,  you 
will  learn  how  to  do  the  spreadsheet  task. 

b.  Then,  you  will  be  asked  to  perform  the  given  spreadsheet  tasks  on  a  computer  (duration: 
approximately  15  minutes). 

c.  With  a  retention  interval  of  6-,  9-,  12-,  18-,  30-,  or  60-day,  after  completing  the  second  step, 
you  will  be  asked  to  return  to  do  the  same  spreadsheet  task  (duration:  approximately  15 
min/trial) 


43 


3.  Voluntary  Participation:  The  participation  of  this  study  is  purely  based  on  volunteerism.  You  can 
refuse  to  answer  any  questions.  At  any  time,  you  can  stop  and  decline  the  experiment.  There  is  no 
penalty  or  loss  of  benefits  if  you  refuse  to  participate  or  stop  at  any  time. 

4.  Right  to  Ask  Questions:  You  can  ask  questions  about  this  research.  Please  contact  Jong  Kim  at 
jongkim@psu.edu  or  814-865-6166  with  questions,  complaints,  concerns,  or  if  you  feel  you  have  been 
harmed  by  this  research.  In  addition,  if  you  have  questions  about  your  rights  as  a  research  participant, 
contact  the  Pennsylvania  State  University’s  Office  for  Research  Protections  at  (814)  865-1775. 

5.  Discomforts  &  Risks:  There  is  no  risk  to  your  physical  or  mental  health.  You  may  experience  eye 
fatigue  because  you  are  interacting  with  a  computer  monitor.  During  the  experiment,  you  can  take  a 
break  at  any  time. 

6.  Benefits:  From  your  participation,  it  is  expected  to  obtain  data  representing  how  much  knowledge  and 
skills  can  be  retained  in  the  memory  over  time.  This  research  can  make  a  contribution  to  design  a  novel 
training  program. 

7.  Compensation:  Participants  will  receive  monetary  compensation  of  $25,  $30,  or  $35  in  terms  of  your 
total  trials,  or  extra  credits  (students  registered  to  1ST  331).  The  experiment  consists  of  5  to  7  trials  ($5 
per  trial).  The  compensation  will  be  given  as  one  lump  sum  after  all  trials.  For  the  amount  of  $30  and 
$35,  participants  will  receive  a  check  issued  by  Penn  State.  Others  will  receive  a  cash  of  $25.  Total 
research  payments  within  one  calendar  year  that  exceed  $600  will  require  the  University  to  annually 
report  these  payments  to  the  IRS.  This  may  require  you  to  claim  the  compensation  that  you  receive  for 
participation  in  this  study  as  taxable  income. 

8.  Confidentiality:  Your  participation  and  data  are  entirely  confidential.  Personal  identification  numbers 
(e.g.,  PSU  ID)  will  be  destroyed  after  gathering  and  sorting  the  experimental  data.  Without  personal 
identification,  the  gathered  data  will  be  analyzed  and  used  for  dissertation  and  journal  publications. 
The  following  may  review  and  copy  records  related  to  this  research:  The  Office  of  Human  Research 
Protections  in  the  U.S.  Department  of  Health  and  Human  Services,  the  Social  Science  Institutional 
Review  Board  and  the  PSU  Office  for  Research  Protections. 

You  must  be  18  years  of  age  or  older  to  take  part  in  this  research  study.  If  you  agree  to  take  part  in  this 

research  study  and  the  information  outlined  above,  please  sign  your  name  and  indicate  the  date  below. 

You  will  be  given  a  copy  of  this  signed  and  dated  consent  for  your  records. 


Participant  Signature 


Date 


Person  Obtaining  Consent  (Principal  Investigator)  Date 


44 


45 


Appendix  G:  Example  Debrief  Form 


HRI  Debriefing  Form 


Thank  you  for  participating  in  our  human-robot  interface  testing  study. 

From  your  participation  we  will  learn  how  people  use  interfaces  in  general  and  Human-Robot 
interfaces  in  particular.  These  interfaces  are  similar  to  those  used  to  interfaces  used  to  work 
in  hazardous  areas  including  those  used  in  rescue  work  at  the  World  Trade  Center.  By 
participating,  you  have  been  able  to  see  and  use  a  new  technology.  The  results  can  lead  to 
improved  interfaces  for  robots  that  replace  humans  in  hazardous  conditions. 

You  may  also  find  the  Robot  project  overview  page  useful  and  interesting. 

If  you  have  any  questions,  please  feel  free  to  ask  the  experimenter.  You  can  also  direct  questions 
to  Dr.  Frank  Ritter,  (frank.ritter@psu.edu,  865-4453). 


46 


Appendix  H:  Example  IRB  Application 

Your  Internal  Review  Board  will  have  its  own  review  forms.  These  forms  are  based  on  each 
IRB’s  institutional  history,  and  the  types  of  studies  and  typical  problems  (and  atypical  problems) 
that  they  have  had  to  consider  over  time.  Thus,  the  form  we  include  here  can  only  be  seen  as  an 
example  form.  We  include  it  to  provide  you  with  an  example  of  the  types  of  questions  and  more 
importantly  the  types  of  answers  characteristic  of  the  IRB  process.  You  are  responsible  for  the 
answers,  but  it  may  be  useful  to  see  examples  to  see  how  long  they  are,  and  how  detailed  they 
need  to  be. 

Following  is  a  form  used  in  one  of  our  recent  studies. 


47 


penn  State 


APPLICATION  FOR  THE  USE  OF  HUMAN 
PARTICIPANTS 
EXPEDITED  &  FULL  REVIEWS 


Office  for  Research  Protections 
2-3'  Kern  Building 
University  Park,  PA  1&632 
r4-efiE-J77& 
Fax:  B14-B63-M8& 
ORFreaJksraSpsu.EdL 


OFFICE  USE  ONLY 
IRB  NO. _ 


Form  Instructions; 

o  To  complete  the  form,  press  TAB  or  SHIFT  TAB  between  boxes  and  enter  an  'X'  or  text.  For  assistance,  contact  the  Office  for 
Research  Protections. 

o  This  application  will  ask  general  questions  about  your  study.  Depending  on  your  response,  additional  appendices  may  need  to  be 
completed  on  order  to  provide  more  detailed  information.  Pot  example,  rf  you  indicate  that  your  study  involves  prisoners,  Appendix 
4  will  also  need  be  be  com  pleted  and  submitted. 

o  Submit  recruitment  materials,  informed  consent  forms,  and  all  other  materials  as  attachments  to  the  application.  Do  HOT  include 
within  Ihe  application. 

o  HandwTitten  applications  will  NOT  be  accepted. _ 


Project  Title:  Gath  eri  mg  Data  From  Com  p  uter  Interface  Users  to  T esl  Cogn  itive  M  o  dels 


Piticipal  Investigator:  Frank  Ritter,  PhD,  C.  Psychol. 

PSU  U  ser  ID  (e.g .,  abcl  £3) :  fe  r2 

University  Status  (Faculty.  Staff,  Student,  etc.):  Faculty 

Telephone  Number:  +1  (314)  665*4453 

Email  Add  less:  frank  .ritter^ps  u  .e  du 

Dept:  Mere 

Coflege:  College  of  1ST 

Campus:  University  Park 

Marlm  q  Ac  dress:  31 6G  Building  1ST 

Faculty  Advisor,  if  PI  is  a  student: 

PSU  User  ID  (e.g.,  abcl £3): 

Email  Address: 

Telephone  Number: 

Depb 

Ccllege: 

Marling  Address: 

Campus: 

Is  there  anyone  you  wish  1o  include  or  correspondence  related  to  this  study  (e.g.,  a  Study  coordinator,  etc.)? 

Name: 

PSU  User  ID  (e  g.,  abcl £3): 

University  Status  (Faculty.  Staff.  Student,  efo): 

Telephone  Number: 

Email  Address: 

Dept: 

College: 

Campus: 

Mailing  Ac  dress: 

Role  rr  this  study:  Choose  one  of  the  following 

Page'  on  3 -veraion  it  -  created  A'laoffl,  Rev  sec  miane 

tni  tenn  it  ava  lade  eteciicncaij-  ^  hllcJ.yfrft.ieseErcti.a5?.  rtiiiarciaraas,,r.frian5fcflel  cal  ens'  nda^asa 


48 


A.  Funding; 

1 .  Is  this  research  study  intemaly  or  e sle malty  funded? 
o  Yes  Answer  Questions  2  -  4 

0  Wo  Skip  to  Question  6 
_  Fending  Answer  Questions  2-  5 

2.  Provide  Ihe  name  and  mailing  acdnass  of  internal  ard  external  sources  cf  funding.  Provide  a  copy  of  your  grant  proposal  with  the 
application.  If  a  copy  of  the  grant  proposal  is  not  included,  explain. 


3.  Is  the  sponsor  providing  the  drug,  device,  etc.  free  of  change?  □  Yes  □  Mo  0  Ni'A 

4.  Has  the  sponsor  agreed  to  pay  for  direct  costs  of  treating  injuries?  0  Yes  □  No 

5.  If  funding  is  not  awarded,  will  the  research  still  be  conducted?  0  Yes  O  Mo  0  NiA 

B.  Conflict  of  Interest; 

3.  Jo  any  of  the  invesbga1or{s),  key  personrel,  andi'or  therr  spouses  or  dependent  children  have  a  conflict  of  mteresi  (CGI),  as  defined 
by  FSU  Policy  RA2Q,  Individual  Conflict  of  Interest, ^  associated  with  Ibis  research? 

_  Yes  (Comp’ete  &  Submit  Appendix  1,  Section  A 
M  No 

7.  Dees  RSU  have  an  ownership  or  roya'ty  interest  in  ary  intellectual  property  related  to  this  study? 

_  Yes  Comp  ete  &  Submit  Appendix  1 ,  Section  B 
H  Mo 

8.  Arejhere  are  other  significant  conflicts  that  could  possibly  affect  or  be  perceived  to  affect  this  study? 

_  Yes  (Comp  ete  &  Submit  Appendix  1,  Section  C 
0  No 

C.  Class  F  rejects ; 

3.  Is  this  a  class  protect? 

0  Yes  Provide  Hie  fbllo^^ing  information: 

■  Instructor's  Name: 

Course  Title  and  Number 
Semester  course  is  being  offered: 

0  Mo 

D.  Review  Level; 

10.  What  fevel  of  review  do  you  expect  this  research  to  reed? 

0  Expedited  Review  Answer  Question  11 
O  Full  Review  Skip  to  Question  l£ 

1 1  Expedited  Research  Categories:  Read  the  following  categories  and  choose  one  or  more  that  appfy  to  your  research].  Your  research 
must  fit  in  at  feast  one  category  and  be  no  more  than  minimal  risk  in  order  Id  be  considered  for  an  expedited  review. 

□  Category  1:  Clinical  studies  of  drugs  and  medica]  devices  ottly  when  condition  (a)  OR  {b)  is  met. 

{a)  Research  on  drugs  'or  which  an  investigational  new  drug  application  (21  CFR  312}  is  not  required.  (Research  on  marketed drugs 
tfiah  sprfcflftfiy  increases  the  risks  or  decreases  the  acceptably  of  the  risks  associated  with  the  use  of  die  produce's  not  eligible 
for  expedited  revww.J 

(b)  Research  on  medica  ’  devices  tor  which  (Tf  an  investigab'ona  device  exemption  applicatfcn  (21  CFR  512)  is  not  required;  cr  {if 
■he  medical  device  is  cleared/approved  for  marketing  and  the  medical  device  is  being  used  in  accordance  with  its  clearedtepproved 
labeling. 


Page  2  cm- Version  1.1  -  Created  A'lfflGffl,  Rev^c  WSSOoe 

Tfii  ■‘tarn  it  sva  laote  Bledncnicaly  ^  Mte.'.Vw*  ■e&eerch.ci-.  ecL^^'areas.’t.nrans.^Jcl  cai  ens:  i-dei  asa 


49 


□  Category  2:  Collection  of  blood  samples  by  fTng-er  stick,  heel  slick,  ear  stick  or  venipuncture  as  follows: 

{a)  From  healthy,  non-pregnant  adults  wtia  weigh  at  least  1  CD  pounds,  ^or  these  participants,  the  amounts  drawn  may  not  exceed 
5H  ml  it  am  6  week  period  and  collection  may  not  occur  more  frequently  than  2  times  per  week  OR 

flb)  ^rom  oiher  adults  ark  children,  considering  Ihe  age,  weight,  ark  health  of  the  participants,  Ihe  co  eclion  procedure,  Hie  amount 
of  bleed  to  be  collected,  and  the  frequency  with  wtiich  it  'will  be  collected.  For  these  participants,  the  amo^t  drawn  may  not  exceed 
the  lesser  of  50  ml  cr  3  ml  per  kg  in  an  fl  week  period  and  collection  may  not  occur  more  frequently  than  2  limes  per  week. 

□  Category  Prospective  collection  of  biological  specimens  for  research  purposes  hy  nan-invasive  means.  Examples  include: 
d  Hair  and  nail  dippings  in  a  non-disfiguring  manner; 

o  Deciduous  teeth  at  Sme  -of  exfoliation  or  if  routine  patient  care  indicates  a  need  for  extraction; 
d  Perma  nent  teelh  if  routine  patient  cane  indicates  a  need  for  extraction ; 

□  Excreta  and  external  secretions  (including  sweat): 

d  Uncannulated  saliva  collected  either  in  an  unsSmulated  fashion  ex  stimulated  by  chewing  gumbase  or  wax  or  by  applying  a 
dilute  citric  solution  to  the  tongue; 

□  Placenta  removal  at  delivery: 

□  Amniotic  fluid  obtained  at  the  time  of  ru  pture  of  the  membrane  prior  to  or  during  labor 

□  Supra-  and  subgingival  dental  plaque  and  calculus,  provided  Ihe  collection  procedure  is  not  more  invasive  than  routine 
prophylactic  staling  of  the  teeth  and  the  process  is  accomplished  in  accordance  with  accepted  prophylactic  techniques; 

d  Mucosal  and  skin  cells  collected  by  buccal  scraping  or  swab,  skin  swob. or  mouth  washings; 

□  Sputum  cd  iecled  after  saline  mist  nebulization 

O  Category  4:  Collection  of  data  through  non-invasive  procedures  (net  invdving  general  anesthesia  or  sedation)  routinely  employed  in 
clinical  practice,  excluding  procedures  involving  x-rays  or  microwaves.  Where  medical  devices  are  employee,  they  must  be 
clearedi'appnoved  for  marketing .  Studies  intended  to  evaluate  the  safely  and  effectiveness  of  Ihe  medical  device  are  not  generally  eligible 
forexpec  iled  review,  including  studies  of  cleared  medical  devices  for  new  indications.  Examp  es  include: 

o  Physical  sensors  that  are  applied  either  to  the  surface  of  the  body  or  at  a  distance  and  do  not  involve  input  of  significant 
amounts  of  energy  into  the  participant  or  an  invasion  of  the  participant's  privacy; 
o  Weighing  or  testing  sensory  acu  ity; 
d  Magnetic  resena  nee  imaging: 

d  Electrocardiography,  electroencephalography,  thermography,  detection  -of  naturally  occurring  radioactivity,  eectroretinography, 
ultrasound,  diagnostic  infrared  imaging,  dcppler  blood  flow,  and  echocardiography; 
o  Moderate  exercise,  muscular  strength  testing,  oody  composition  assessment,  and  flexibility  testing  where  appropriate  given  the 
age,  weight,  and  health  of  the  individual. 

□  Category  5:  Research  invoving  materia s  (data,  documents,  records,  or  specimens)  that  have  been  collected  or  will  be  collected 
solely  for  non-research  pLrposes  (such  as  medical  treatment  or  diagnosis}. 

E3  Category  6:  Collection  of  data  from  voice,  video,  digital,  image  recordings  made  for  research  purposes. 

.  I  Category  7:  Research  on  individual  or  group  characteristics  or  behavior  (including  but  not  limited  to  research  on  perception, 
cognition,  motivation,  identity t  language,  ocmmiLnication,  cultural  beliefs  or  practices,  and  soda I  behavior)  or  research  employing  survey, 
interview,  oral  history,  focus  group,  program  evaluation,  human  'actors  evaluation,  or  quality  assurance  methodologies. 

E.  Research  Ferecmnel; 

NOTE: 

*  The  Principal  investigator  is  responsible  for  ensuring  that  all  individuals  conducting  procedures  described  in  this  application  are  trained 
adequately  prior  to  involving  human  participants. 

*  All  personnel  listed  on  this  application  who  (1}  are  respensibe  for  the  desigrticcnduct  of  the  study,  (2)  will  have  access  to  the  human 
participants  (i.e.,  will  consent  participants,  conduct  the  study),  or  (3)  will  have  access  tc  identifying  AND  confidential  information  must 
successfully  complete  the  IRETs  Training  on  Ihe  Protection  of  Human  Participants  or  provide  verification  of  training  from  their  home 
institution.  PS  J'e  training  may  be  located  at  Mpl»Vww,.research.psu.edUiorpifoducatkn«'moduleS''irc.'lncex.asp.  Approval  will  NOT  be 
granted  until  all  individuals  have  successfully  completed  the  training.  Verification  of  training  does  NOT  need  to  be  sent  in  if  the 
individual  com  deled  the  ^enn  Slate's  training. 

*  As  personnel  cha  -ge.  you  must  submit  a  Afodffoatfoo  Request  Form  -  Expedited  &  Fuff  Review  to  add  or  remove  personnel . 


Page  3  ell  3  -  vmfr  it  -  cnsaltd  *li2E0G;  Rev  sec  razroe 

TriJ  ■‘tarn  it  sva  lade  ateclrcmcalsi-  ^  Hlc.'.Wtt*  ■essgrch.cs-  6CL-a-~ciafea&.'iLn;aa&,£  xl  cal  ens:  i-Jei  asa 


50 


□  Category  2:  Collection  of  blood  samples  by  fTng-er  stick,  heel  slick,  ear  stick  or  venipuncture  as  follows: 

{a)  From  healthy,  non-pregnant  adults  wtia  weigh  at  least  1  CD  pounds,  ^or  these  participants,  the  amounts  drawn  may  not  exceed 
5H  ml  it  am  6  week  period  and  collection  may  not  occur  more  frequently  than  2  times  per  week  OR 

flb)  ^rom  oiher  adults  ark  children,  considering  Ihe  age,  weight,  ark  health  of  the  participants,  Ihe  co  eclion  procedure,  Hie  amount 
of  bleed  to  be  collected,  and  the  frequency  with  wtiich  it  'will  be  collected.  For  these  participants,  the  amo^t  drawn  may  not  exceed 
the  lesser  of  50  ml  cr  3  ml  per  kg  in  an  fl  week  period  and  collection  may  not  occur  more  frequently  than  2  limes  per  week. 

□  Category  Prospective  collection  of  biological  specimens  for  research  purposes  hy  nan-invasive  means.  Examples  include: 
d  Hair  and  nail  dippings  in  a  non-disfiguring  manner; 

o  Deciduous  teeth  at  Sme  -of  exfoliation  or  if  routine  patient  care  indicates  a  need  for  extraction; 
d  Perma  nent  teelh  if  routine  patient  cane  indicates  a  need  for  extraction ; 

□  Excreta  and  external  secretions  (including  sweat): 

d  Uncannulated  saliva  collected  either  in  an  unsSmulated  fashion  ex  stimulated  by  chewing  gumbase  or  wax  or  by  applying  a 
dilute  citric  solution  to  the  tongue; 

□  Placenta  removal  at  delivery: 

□  Amniotic  fluid  obtained  at  the  time  of  ru  pture  of  the  membrane  prior  to  or  during  labor 

□  Supra-  and  subgingival  dental  plaque  and  calculus,  provided  Ihe  collection  procedure  is  not  more  invasive  than  routine 
prophylactic  staling  of  the  teeth  and  the  process  is  accomplished  in  accordance  with  accepted  prophylactic  techniques; 

d  Mucosal  and  skin  cells  collected  by  buccal  scraping  or  swab,  skin  swob. or  mouth  washings; 

□  Sputum  cd  iecled  after  saline  mist  nebulization 

O  Category  4:  Collection  of  data  through  non-invasive  procedures  (net  invdving  general  anesthesia  or  sedation)  routinely  employed  in 
clinical  practice,  excluding  procedures  involving  x-rays  or  microwaves.  Where  medical  devices  are  employee,  they  must  be 
clearedi'appnoved  for  marketing .  Studies  intended  to  evaluate  the  safely  and  effectiveness  of  Ihe  medical  device  are  not  generally  eligible 
forexpec  iled  review,  including  studies  of  cleared  medical  devices  for  new  indications.  Examp  es  include: 

o  Physical  sensors  that  are  applied  either  to  the  surface  of  the  body  or  at  a  distance  and  do  not  involve  input  of  significant 
amounts  of  energy  into  the  participant  or  an  invasion  of  the  participant's  privacy; 
o  Weighing  or  testing  sensory  acu  ity; 
d  Magnetic  resena  nee  imaging: 

d  Electrocardiography,  electroencephalography,  thermography,  detection  -of  naturally  occurring  radioactivity,  eectroretinography, 
ultrasound,  diagnostic  infrared  imaging,  dcppler  blood  flow,  and  echocardiography; 
o  Moderate  exercise,  muscular  strength  testing,  oody  composition  assessment,  and  flexibility  testing  where  appropriate  given  the 
age,  weight,  and  health  of  the  individual. 

□  Category  5:  Research  invoving  materia s  (data,  documents,  records,  or  specimens)  that  have  been  collected  or  will  be  collected 
solely  for  non-research  pLrposes  (such  as  medical  treatment  or  diagnosis}. 

E3  Category  6:  Collection  of  data  from  voice,  video,  digital,  image  recordings  made  for  research  purposes. 

.  I  Category  7:  Research  on  individual  or  group  characteristics  or  behavior  (including  but  not  limited  to  research  on  perception, 
cognition,  motivation,  identity t  language,  ocmmiLnication,  cultural  beliefs  or  practices,  and  soda I  behavior)  or  research  employing  survey, 
interview,  oral  history,  focus  group,  program  evaluation,  human  'actors  evaluation,  or  quality  assurance  methodologies. 

E.  Research  Ferecmnel; 

NOTE: 

*  The  Principal  investigator  is  responsible  for  ensuring  that  all  individuals  conducting  procedures  described  in  this  application  are  trained 
adequately  prior  to  involving  human  participants. 

*  All  personnel  listed  on  this  application  who  (1}  are  respensibe  for  the  desigrticcnduct  of  the  study,  (2)  will  have  access  to  the  human 
participants  (i.e.,  will  consent  participants,  conduct  the  study),  or  (3)  will  have  access  tc  identifying  AND  confidential  information  must 
successfully  complete  the  IRETs  Training  on  Ihe  Protection  of  Human  Participants  or  provide  verification  of  training  from  their  home 
institution.  PS  J'e  training  may  be  located  at  Mpl»Vww,.research.psu.edUiorpifoducatkn«'moduleS''irc.'lncex.asp.  Approval  will  NOT  be 
granted  until  all  individuals  have  successfully  completed  the  training.  Verification  of  training  does  NOT  need  to  be  sent  in  if  the 
individual  com  deled  the  ^enn  Slate's  training. 

*  As  personnel  cha  -ge.  you  must  submit  a  Afodffoatfoo  Request  Form  -  Expedited  &  Fuff  Review  to  add  or  remove  personnel . 


Page  3  ell  3  -  vmfr  it  -  cnsaltd  *li2E0G;  Rev  sec  razroe 

TriJ  ■‘tarn  it  sva  lade  ateclrcmcalsi-  ^  Hlc.'.Wtt*  ■essgrch.cs-  6CL-a-~ciafea&.'iLn;aa&,£  xl  cal  ens:  i-Jei  asa 


51 


12.  Provide  the  name  of  the  elher  individuals)  assisting  wrtf"i  this  study  who  (1)  wil  be  responsible  far  the  de  sign 'conduct  of  the  study. 
^2)  have  access  to  the  human  participants  (i_e_,  wil  consent  participants,  con  duct  the  study).  or  (3}  have  access  t o  identifying  AND 
confidential  information.  If  the  individual  does  not  have  a  FSU  Access  User  ID.  please  provide  some  other  form  of  contact  information. 
If  additional  space  is  needed,  attach!  a  separate  sheet  containing  the  same  information. 

Ma  il  ing  Ad  d  ness  Role  in  this  Stu  dy 

College  of  1ST,  Research  Assistant 
Penn  State, 

University  Park,  PA 
16B02 

Choose  one  of  the  following 
Choose  one  of  Hie  following 
Choose  one  of  the  following 
Choose  one  of  the  following 
Choose  one  of  the  following 
Choose  one  of  fre  following 

13.  Identify  [1)  the  procedures  'techniques  each  person  (mdudmg  advisors)  listed  in  Question  12  and  on  the  first  page  of  the 
application  will  perform  and  (2)  describe  their  level  of  research  experience. 

(1)  Each  person  will  recruit  subjects  for  a  cognitive  psychology  study,  to  run  the  subjects  in  a  cognitive  psychology 
study,  and  to  store  the  data. 

(2)  Ritter  has  taught  these  research  methods  and  run  studies  in  this  area  since  1990.  Friedrich  has  passed  the  FSU  IRB 
test  and  has  been  instructed  by  Ritter,  and  will  continue  to  be  trained. 

14.  Explain  how  If  e  persons  assisting  with  this  research  are  kept  adequately  informed  about  the  study  ard  their  research-related  duties 
and  functions.. 

The  persons  will  be  instructed  in  how  to  perfomn  cognitive  psychology  studies,  and  their  performance  will  be 
monitored  by  Ritter.  This  will  include  discussions  following  running  the  first  five  subjects. 

F.  Purpose  A  Procedures: 

15.  Pfo'.Tde  a  detailed  desorption  of  the  research  that  includes  (ff  the  background,  (2)  arm&'cbjectives  [hypothesrsj,  and  (3)  a 
description  of  how  the  research  will  be  conducted  [rr  ethodciogy  —  wtiat  participants  wiH  be  asked  to  doj. 

This  research  is  designed  to  look  at  how  people  problem  solve,  and  how  they  leam.  The  task  used  will  be  a  previously 
used  simple  interface  problem  solving  task.  (2)  The  hypothesis  are  related  to  how  fast  the  learning  will  occur,  and  are  an 
exploration  to  discover  what  strategies  users  use.  (3)  The  participants  will  be  asked  to  read  instructional  materials,  and 
then  to  solve  faults  in  a  simple  device.  Their  behavior  will  be  recorded  while  they  do  this. 

1 8.  Hew  fong  will  participants  be  involved  in  this  resea  rcJhi  study?  Indu  de  the  nurr  be  r  of  sessions  and  the  d  uratron  of  each  session. 

1  sessions,  30  to  5D  minutes  per  session. 

1 1.  Where  'will  this  research  study  take  place1?  Choose  all  that  apply. 

0  University  Park.  Specify  the  building  ard  reem  number.  If  net  yet  known,  ihdicale  such.  319b  1ST  Building 

Cl  GCRC  at  University  Park 

_ I  Other  PSU  Cam  pu  s  Location  Specify  Ih  e  cam  pu  s.  buildin  g  and  room  number.  If  not  yet  known . 

indicate  such. 

_  Hershey  Medrcal  Center  Specify  the  building  and  rocm  number.  If  not  yet  known,  indicate  such. 

ED  GCRC  at  the  Hershey  Mledrcal  Center 
ED  ML  Nittany  Medical  Center 
ED  Other  Sfte(s)  Explain : 

NOTE:  For  other  sites  such  as  schools,  doctor  offices,  businesses,  etc.,  the  IRB  requires  that  research  conducted  at  these 
sites  be  approved  by  an  individual  in  a  decision  making  position  at  the  site.  Documented  approval  [i.e„  a  letter  of  agreement) 
is  required. 


Name 


Maik  Friedrich 


Email  Add  ress  P  SU  User  ID 

(e.g..  abc  123) 

mFriedrich@ist.p5u.edu  muf10 


Page  s  ell  3  -  Verais-  1.1  -  Creeled  *.'1,2016;  Rev  sec  TKSSOoe 

Tr& ■‘tarn  it  sva  lade  eteclrcmcal!,-  A  Mle.Vtfw*  ■eteerch-ct.  ecL^^-areat jc\  cal  ent,'  i-dei  asa 


52 


16.  Is  this  a  multi-center  study  outside  of  FSU? 

ED  Yes  Answer  Question  10 
X  No  Skip  Id  Question  22 

19.  Is  any  Penn  State  investigator  on  this  application  ttie  lead  investigator  (project  director}  of  this  multi-center  study? 
ED  Yes  Answer  Questions  20  -  21 
ED  Ho  Skip  be  Question  22 


20.  Provide  the  narre  and  location  of  all  other  centers.  Copies  of  IRB  approval  letters  from  each  site  will  he  required  wth  the 
supporting  documentation  for  this  application.. 


21.  Describe  the  plan  for  the  manager  enl  and  communication  of  multi-site  information  that  may  be  relevant  bo  the  protection  of 
participants  fe.g.,  unanticipated  problems,  adverse  events,  interim  analyses,  modifications). 


22.  How  will  the  data  be  analyzed? 

The  data  will  be  summarized  across  problem  series;  the  data  will  be  compared  to  existing  predictions  of  a 
computational  model;  the  pattern  of  the  fit  to  the  predictions  will  be  used  to  find  and  define  new  strategies. 

23.  List  criteria  for  inclusion  of  participants. 

Willingness  to  participate  in  study.  Over  10  years  old. 

24.  List  criteria  for  exclusion  oT participants. 

None. 


G.  Participants; 

25.  Maximum  number  of  participant&'samples'rtarts  to  be  enrolled  al  this  institution  (Enter one  number  -  not  a  range):  20 

26.  W'as  a  statistical^  wer  analysis  con  ducted  to  determine  the  adequate  sample?  ED  Yes  ED  No  NA  This  study  does  not 
use  inferential  statistics. 


27.  Does  this  research  exclude  any  particular: 
Gender  Id  entity  |_|  Yes  El  No 

Racial/ethnic  groups  ED  Yes  X  No 

Sexual  Orientation  ED  Yes  E)  No 


If  Yes.  please  explain. 
If  Yes,  piease  explain. 
If  Yes,  please  explain. 


26.  Agerange  -  Choose  all  that  apply. 

CD  Less  than  1  year  0  7- 12  years  X  16  -25  years 

Ql-Syears  [~]l3«l7years  E]  26-40  year5 


X  40-65  years 
0  65+  years 


29.  Choose  all  categories  of  participants  who  wil  be  involved  in  this  research  study. 

X  Hea  lthy  volunteers 

X  Penn  State  studenls 

□  Subject  Pool  Studenls  -  Indicate  the  subject  pocf:  O  CAS  1 00 A  ED  Psychology  -  UP  ED  Psychology  -  Behrerd 

Will  all  participants  involved  in  this  study  be  from  the  subject  pool?  ED  Yes  ED  No _ 

CD  Children!  -  Individuals  under  the  age  of  16  jCcmplete  ft  Submit  Appendix  2 

CD  International  Research  -  participants  live  outside  of  the  U.S.  Cc mplele  ft  Submil  Appendix  3 

□  Prisoners  |Complete  ft  Submit  Ap  pendix  j 

□  Pregnant  Women 

XI  Women  of  reproductive  potential  at  the  time  of  this  research  -  Choose  one  of  the  following: 

X  the  research  poses  no  added  risk  associated  with  pregnancy  andi'or  lactation 

ED  Precautions  against  pregnancy  and/or  lactation,  and  pregnancy  tests  are  ac dressed  in  the  research  proposal  and  consent 
form 


Page  5  ell  3  -  Verafr  it  -  Created  4Hi2EQG;  Rev  aai  7^2036 

Trii  ■‘tom  it  sva  lade  etedtemcalji-  ert  l  llc.^w*  reteerch.ct.  eci.  a^areat'TLnian&'s  3d  cal  ent,'  i-dei  asa 


53 


II  Patents  Cc  rrplele  &  Submit  Appendix  5| 

0  Individuals  with  a  decisional  rmparm-enl  who  are  targeted  ter  thus  study  (e.g.,  research  cr  Alzheimer  s  enrolling  only  individuals 
with  Alzheimer's)  [Complete  &  Submit  Appendix  6| 

Q  Individuals  with  a  decision  impairment  who  are  NOt  targeted  for  this  study  (e.g..  deosionally  compromised  person  eligible  for 
a  study  on  a  new  treatment  for  breast  cancer)  Complete  &  Submil  Appendix  E 

D  Institutionalized  individuals  (e.g.,  patents  in  stale  hospitals  or  nursing  hemes)  Cc mplele  &  Submit  Ap pen d  ix  7 

□  Fetus,  embryo,  fetal  material  in  vilro  fertilization 
II  None  cf  the  above  categories  will  be  used  in  this  research 


34  Will  participants  be  currently  enrolled  in  a  coursei'dass  of  any  personnel  listed  cm  this  application? 
□  Yes  Describe  Ihe  measures  la  Ken  to  avoid  coercion  3?  undue  influence: 

IS  No 

31.  Will  pa'tscpanls  be  employees  of  any  personnel  Irslec  on  this  application? 
d  Yes  Describe  Ibe  measures  laken  to  avoid  coercion  &  endue  influence: 

IS  No 


32.  Could  some  or  all  participants  be  vulnerable  to  ccercion  or  undue  influence  due  to  special  circumstances?  Co  cot  indude  children, 
deosonalTy  rmpaired  persons,  and  prisoners  in  your  answer. 

|  Yes  Describe  Ihe  measures  la  Ken  to  protect  these  individuals: 

IS  No 


H.  RecTuitmenh 

35.  Indicate  the  types  of  recruitment  that  will  be  done  for  this  research  &  attach  copies  of  the  materials.  Choose  ail  that  apply: 
_  Wewspaper.toiagazrre  ads 
_  Radio/TV  ads 

_  Letters/E  m  ails  to  poten  tial  partidpanls 

Enplaii  how  potential  participants  conlad  information  was  obtained: 

_  Letters/E  mails  to  healthcare  professionals  for  recrurlment  purposes 

Which  healthcare  groups  will  receive  these  letters? 

3  Flyers/posters  -  Where  wil  the  items  be  displayed/distributed?  In  the  college  of  1ST 

_  Brochures  -  Where  will  the  items  be  dispfayed/dislnbuled? 

Il  Web  srtes-  List  the  sites  Ihe  reemrtmert  materials  will  be  posted: 

_  Email  via  Listserv  -  Has  permission  been  obtained  from  Ihe  listserv  administrator?  0  Yes  0  No 
II  Scnpt  -  Vertical  (i.e.,  telephone,  face-to-face,  dassreom) 

Subject  Pool  Ind  icate  'which  subf ect  pool  will  be  used : 

0  CAS  100A  0  Psychology  -  UP  0  Psychology  -  Behrend 

Note:  If  you  are  not  a  member  of  the  subject  poors  department,  a  permission  letter  will  be  needed. 

0  Other  Explain: 


34.  Who  will  approach  amdi'or  respond  to  potential  partidpanls? 
Ritter  and  Friedrich 


3b.  Before  potential  participants  sign  a  ccrsent  form,  are  there  any  screening  questions  that  will  be  asted  to  determine  whether  an 
individual  is  appropriate  for  the  stu  dy? 

_  Yes  Answer  Question  36 
0  No  Ship  bo  Question  37 

34  During  screeflmg  questions,  will  identifiable  information  about  these  individuals  be  recorded? 

Yes  [Complete  Submit  Appendix  j 

0  No 


Page  5  ell  3  -  Vera**1  l.l  -created  fllfflOK;  Rev  7f3f3006 

Tfr*  ■‘tarn  it  sva  lade  eteclrcmcalsi-  ^  l  llc.vww*  ■esggrch.ci-.  6Cl  ^c-area^-nfansre  3d  cal  cm'  rJei  asa 


54 


NOTE;  Please  attach,  as  appropriate,  a  procedure  and  script  for  the  screening  questions.  Also,  attach  a  copy  of  the  screening 
question  data  collection  sheet. 

37.  Will  investigators  access  medical  charts  ardi'or  hosptalfclinic  databases  lor  remiitmenl  purposes? 

Hi  Yes  Answer  Question  39 
0  No  Ship  to  Question  39 

34  Has  a  waiver  cf  authorization  to  access  protected  health  inforraatron  been  requested? 

HI  Yes 

11  No  Ejqpla  in  Viihy  a  waiver  of  au  If  orizalion  ha  s  NOT  been  requested: 

39.  Will  phyanansi'clinrcrans  provide  rdentrfiable,  patient  information  (e.g.,  name,  telephone  number,  address)  to  investigators  for 
recruitment  purpose  s? 

|  Yes  Provide  a  copy  of  the  written  aulf  onzabon  release  form  lor  review. 

0No 

I.  Consent: 

40.  Wien  and  where  will  participants  be  approached  to  obtain  informed  consenti'assenl  [include  Ihe  timing  of  obtaining  con  sent  in  the 
response]?  If  participants  could  be  non-English  speaking,  illrterale  or  have  other  special  circumstances,  describe.  Attach  a  copy  of  the 
informed  consent/assent  Pdrm(5). 

Participants  will  be  approached  to  obtain  informed  consent  when  they  come  to  participate  in  the  study. 

41.  Who  will  be  responsible  lor  obtaining  informed  ccnsenb'assent  from  participants? 

The  experi mentor  running  the  session.  Ritter  or  the  RA. 

42.  Do  the  people  listed  in  Question  41  above  speak  If  e  same  lan  guage  as  the  participants'? 

0  Yes 

HI  No  Ejspla  in  hew  con  sent  mil  be  obta  ined. 

43.  Whal  type  of  consent  will  be  obtained1?  Choose  all  that  apply. 

X  Signed  consent  -  participant  will  sign  consen  t  form 

□  Implied  consent  -  participant  wii  not  sign  consent  torm  {e.g.,  mail  survey,  email,  on-line  survey) 

Comp  ete  &  Submit  Append  ix  9,  Section  A 

□  Verbal  consent  -  participant  grves  consent  verbally  (e.g.,  in-person  interview,  telephone  interview) 

Complete  &  Submit  Appendix  9,  Section  A 

Hi  FassiveTOptOut  consent  -  participant  only  req  uired  to  act  if  they  do  not  want  Id  participate 
Comp  ete  &  Submil  Append ix  9,  Section  B 

_  Compete  waiver  of  in  farmed  ccrserl 

Comp  ete  &  Submit  Appendix  9,  Section  B 
0  Other  Describe: 

44.  If  multiple  groups  of  partinparts  are  being  utilized  (i.e.,  teachers,  parents,  children,  people  over  IB),  who  will  and  will  not  sign  Ihe 
assenb'consent  form?  Specify  for  eacf  group  of  participants. 


45.  Participants  are  to  receive  a  copy  of  tf  e  informed  consent  form  with  the  approval  bex/statememt  on  it.  Describe  how  participants 
will  receive  a  copy  of  the  informed  consent  form  1o  keep  for  their  records. 

They  will  sign  two  copies,  and  one  copy  will  be  handed  to  them. 


J.  Payment  for  Participation: 

46.  Indicate  the  type  and  amount  of  payment  for  participation  that  will  be  offered.  Choose  all  that  apply. 
0  Money  Amount:  $7 

0  Gift  Certificate  Amount: 

0  Extra/Class  Credit  {e.g.,  5  points,  1%  of  final  grade)  Amount: 

0  Drawmg  Explain: 


Skip  to  Question  49 
Skip  to  Question  49 
Skip  to  Question  47 
Skip  to  Question  49 


Page  T  ell  3  -  verais-  v  -  created  Rev  wsSOM 

Tfri  ■‘tarn  it  ava  lade  eteclrcmcalsi-  ^  Hlc.VffW*  ■etfrgrch.ct.  egL^’-careat.'t.maa&'s  xl  cal  cat'  rdei  as  a 


55 


Other  (e.g.,  merchandise) 

_  Compensation  will  NOT  be  offered 


Explain: 


Skip  fa  Question  48 
Skip  to  Question  49 


47.  An  alternative,  equal  in  time  ard  effort  must  be  offered  in  place  of  participating  in  the  research.  Describe  the  alternative  available 
for  earning  the  extra/class  credit.  The  description  should  include  the  length  of  time  it  will  take  to  complete  the  alternative  as  welt  as  how 
undue  influence  will  he  prevented. 


43.  Will  compensation  be  pro-ralec?  NOTE:  Pto-rating  is  required  for  FDA-regufated  studies. 
Yes  Explain  how  payment  will  be  pro-rated: 

0  No 


K.  Data  Collection  Measuresi'lnstruments; 

49.  Choose  any  of  the  following  data  collection  measures/instmiinents  that  will  be  used  in  this  study.  Attach  a  copy  of  all 
instruments/measures,  interview  and  focus  group  Eopics/questions  to  the  application. 

Biological  Specimens  -  Mood,  urine  &  other  human  d  erivec  sa  mples 
_  Biomedical  Devices  -  EEG,  EKG,  MRl 
_  Diaries/Joumals  completed  by  the  participants 

□  Focus  Groups 

_  Individual  Interviews 
=  Knowlec  geCcgnrtive  T ests 
J  Observations 

□  Physical  Testing  Measures  -  Height,  Weight,  Body  Mass  Index,  Blood  Pressure 
!  Questionnarresi'Surveys-  Mail,  Internet,  Telephone,  Email.  Papen'Pencil 

S!  Other  Explain:  Mouse  moves,  keystrokes,  and  video  lecoritiigs. 


50.  Will  participants  be  assigned  to  groups? 
_  Yes  Answer  Questions  51  -  52 
0  No  Skip  to  Question  54 


51.  Will  a  control  groupfs)  be  used? 

_  Yes  Choose  one  of  the  following: 

Placebo  control 
□  Standard  therapy  control 
O  Other  control  method  Explain: 
□  No 


5£.  Is  the  research  a  binded  (masked)  study? 

□  Yes  Answer  Question  5S 

□  No  Skip  to  Question  54 

53.  Is  emergen cy  unbundling  permitted? 

D  Yes 

dl  Ho  Explain  why  emergency  unblinding  is  HOT  permitted: 


L  Recordings  -  Audio,  Video,  Photographs 

54.  Will  ary  type  of  recordings  (audio  or  video!  or  photographs  be  made  during  this  study? 
0Yes  Complete  &  Submit  Appendix  ID 

0  No 


M.  Computertlnternet 

55.  Will  any  participant  interaction  in  this  sludy  be  conducted  on  the  Internet  or  via  email  {e.g..  on-lime  ^rveys.  observations  of  chat 

moms  or  blogs,  on  -line  interviews!'? _ 

j  Yes  Cc^olele  &  Submit  Appendix  11.  Section  A 
El  No 

Page  B  ell  3  -  Venator  V  -  Creatol  ATiOOfi,  Re*  sk  Tl3f2m 

Tfr4  lonm  i&  Sva  latfe  eSKlrcnscal^  at  Mlc  .■■>rt-*.ie^€n:r.c^  K^are-area^-ivans/s  xl  cal  cns,~  i-Je«.  aaa. 


56 


56.  Will  a  commercial  server  -lie..  SurveyMonkey,  Psych  Dala,  Zocmerang)  be  used  to  colled  cala  or  fcr  cata  sic  rage? 

LI  Yes  jCcmplele  &.  Submit  Appendix  11.  Section  B 
0  No 

N.  Discomforts  and  Ris  ks 

57.  List  all  of  the  potential  discomforts  and  risks  (physical,  psychological,  legal,  social  or  financial)  and  describe  the  likelihood  or 
serousness  of  the  discomfortsi'risk.  If  there  are  no  discomfortsirisks,  stale  such. 

No  additional  discomforts  or  risks  for  participants  beyond  daily  life. 


56.  Describe  how  risks  wil  be  minimized  and/or  how  participants  will  be  protected  against  potential  risks  throughout  If  e  study. 

There  are  no  additional  risks  to  participants. 

56.  Does  this  research  involve  greater  than  minimal  risk  to  If  e  pa  rtrapants? 

Yes  Answer  Questions  60  -  61  Study  must  be  reviewed  by  the  Full  IRB  at  a  convened  meeting. 

0  Mo  Skip  to  Question  62 

60.  Will  medical  or  psychological  care  be  available  for  participants  who  may  require  it  as  a  result  of  Ihe  study? 

Yes  Identify  the  source  of  medical  or  psychological  care  available  -  include  address  &  telephone  number: 

0  No  Explain  why  rredrcal  or  psychological  care  wil  NOT  be  available:  It  is  difficult  to  choose  what  possible  care 

could  be  required. 

61.  Does  the  research  protocol  have  a  plan  far  routine  analysis  or  monitoring  of  the  data  ard  safety  of  this  research  study? 

O  Yes  Complete  S  Submit  Appendix  T7] 

X  No  For  studies  involving  greater  than  minimal  risk,  a  plan  wil  need  to  be  developed  for  review  and  approval  at  the 
convened  IRB  meeting. 

0.  Benefits 

62.  What  are  the  potential  benefits  to  the  individual  participants?  If  none,  stale  such.  PLEASE  NOTE:  Payment  for  participation  cannot 
be  considered  a  benefit 

The  re  are  two  benefits  put  f  o  rward .  Participa  nts  wil  I  g  et  to  see  how  a  stu  dy  is  perf o  rmed ,  a  n  d  the  data  1h  at  is  g  athered 
may  lead  to  improved  instructional  material  for  them  and  for  society. 

63.  What  are  the  potential  benefils  to  society?  If  none,  slate  such. 

The  results  from  the  study  can  improve  our  understanding  of  how  learning  and  problem  solving  occurs.  This  has 
implications  fortmining  and  education. 

64.  Explain  how  Hie  benefits  outweigh  Ihe  risks. 

There  are  minimal  risks,  and  the  benefits  are  that  this  study  can  lead  to  improved  training  and  learning  paradigms. 

F.  Reporting 

6b.  Is  it  possible  investigators  will  discover  a  participant's  previously  unknown  condition  (e.g.,  disease,  suicidal  thought,  wrong 
paternity)  as  a  result  of  study  procedures? 

I  .Yes  Explain  how  and  when  such  a  drscovery  will  be  handled: 

0  No 

66.  Is  it  possible  investigators  will  discover  a  participant  is  engaging  in  illegal  activities  (e.g.,  drug  use.  domestic  violence,  child 
abusei'neglect,  underage  drinking)  as  a  result  of  study  procedures? 

_ Yes  Explain  how  and  when  such  a  drscovery  will  be  handled: 

fxl  No 

Q.  Deception 

67.  Does  this  study  involve  giving  false  cr  misleading  information  to  participants  or  withholding  information  from  them  such  that  their 
‘into  rmed”  consent  is  in  question? 

Page  3  cl  1 3  -  Venator  V  -  Cnasled  «1i2DW;  Re*  sk  mEffifi 

TNi  tan  i&  sva  lirte  etoelrcmcal!,-  at  Mlc  .v.^.'esaarcr.c^-  Ku-crc-aiiee&.  'r.ivans.'axl  cal  tns;  ide«.  aaa. 


57 


_!  Yes  (Complete  4  Submit  Appendix  1Z 

0No 


R.  Confidentiality  and  Privacy 

63.  Describe  the  provisions  made  to  maintain  confidentiality  of  trie  data.  Choose  all  that  apply. 

□  Password  protected  computer  files  □  Locked  offices 

□  Locked  file  cabinets  □  Other  Explain: 

X  Identification  code  (i.e.,  code  numbers,  pseudonyms)-  data  wi  NOT  be  associated  w/peraonal  identifiers 

69.  Describe  the  provisions  made  to  protect  participant'  privacy  interests. 

No  identifying  information  will  be  kept  with  the  keystroke  logs  or  with  the  videos. 

76.  Who  will  have  access  to  the  data? 

Researchers  approved  to  work  on  this  study. 

71.  Will  identifiers  be  disclosed  to  a  sponsor  or  collaborators  at  another  rrstriulron'? 

_ Yes  List  the  identifiers  lhat  wil  be  disclosed  and  explain  why  this  is  necessary: 

0  No 

72.  Will  a  list  containing  a  code  (i.e..  code  numbers,  pseudonyms)  and  participants'  identity  be  used  in  this  study? 

X  Yes  Answer  Questions  73  -  75 

□  No  Skip  to  Question  76 

73.  Where  will  the  list  linking  the  code  to  participants'  identity  be  stored  ard  hew  wil  the  list  be  secured? 

List  will  be  kept  on  a  printed  sheet  in  the  Pi's  office. 

74.  Who  will  have  access  to  Ihe  list  linking  the  code  to  participants'  identity? 

the  RA  when  running  the  study  and  PI  after  the  study  has  been  nun. 

75.  Will  the  list  linking  the  code  to  participants  identity  be  destroyed? 

0Yes  When  wil  the  list  be  destroyed?  5  years  after  running  the  study  or  after  the  last  publication,  whichever 
comes  last. 

□  No 

76.  What  will  happen  to  toe  research  records  when  Ihe  research  has  been  completed?  Choose  only  erne. 

_  -Stored  indefinitely  with  identifiers  removed 

0  Stored  indefinitely  with  rdentrfrers  attached 

List  the  identifiers  that  will  be  a  Itached  to  Ihe  data :  subject  ID 

Explain  why  the  cata  must  be  stored  ir  definitely  with  identifiers:  once  an  ID  has  been  assigned,  analyses  use  that  ID 

_ Stored  for  length  of  time  recurred  by  federal  regulattonsi'fending  source  &  then  destroyed  (minimum  of  3  years) 

_  Destroyed  after  a  number  of  years  {minimum  of  3  years)  Specify  the  number  of  years: 

_ Destroyed  when  notified  by  sponsor 

3  Other  Explain: 

77.  Gould  the  information  being  collected  fdf  this  study  have  adverse  consequences  for  participants  or  be  damaging  to  toieir  financial 
standing,  employability,  insurability  or  reputation? 

_ Yes  Indicate  the  type  of  information  bein  g  col  lected : 

□Substance  abuse  c*  other  illegal  risk  behaviors 
I  iDelermi  naticn  of  H IV  status  for  the  resea  nti 
□Genetic  information  about  inheritable  diseases 
□Other  Explain: 

0  No 

73.  Will  a  ‘Certificate  of  Confidentiality1  be  obtained  from  toe  federal  government? 

_ Yes  Indicate  who  wil  obtain  the  Certificate  of  Confidentiality 

Page  JD  art  13- Version  1.1-  Creeled  Re«wc  flaiHHK 

Tfri  'em  is  sva  la^e  eSedrcnscal^  at  ill  o  ■'■'■^■esearcr.ES-  Ku-ais-aii&as^ivans.’axl  cal  ons,'  asa. 


58 


Sponsor 

Principal  Investigator 
Other  Explain: 


El  No 


S.  Health  Insurance  Portability  &  Accountability  Act  (HIPAA)  -  Use  of  protected  health  information 
79.  Will  participants  protected  health  information  {PHI)  be  obtained  for  this  study? 

O  Yes  Complete  &  Submit  Appendix  15 

HNo 

T.  Drugs,  Medical  Devices,  and  Other  Substances 

BO.  Does  this  research  study,  involve  drugs  or  biologies? 

□  Yes  [Complete  &  Submit  Appendix  14.  Section  A 
El  No 


Si.  Does  this  research  study  involve  a  device? 

I  ]Yes  Go  to  Question  32 
No  Ship  to  Question  B3 

Si.  Does  if  e  device  meet  the  FDA'a  defmlticin  of  a  medical  device? 
El  Yes  Complete  &  Submit  Appendix  14.  Section  C 

O  No  Go  to  Quesbon  BS 


FDA's  Definition  of  a  Medical  Device:  If  a  product  is  labeled,  promoter  or  used  in  a  maimer  that  mee'.s  the  follow'ng  definition  in  section 
201{tiji  of  the  Federa  l  -ood  Drug  and  Cosmetic  -!FD&C}  Act  it  will  be  regulated  by  the  Food  and  Drug  Administration  (FDA)  as  a  medical 
device  and  is  subject  to  pre-marketing  and  post-marketing  regulatory  controls.  A  device  is: 

*  'an  instrument,  apparatus,  implement,  machine,  contrivance,  implant,  in  vitro  reagent,  or  other  similar  or  related  article,  induding  a 
component  pari  or  accessory  which  is: 

o  Recognized  in  the  official  National  Formulary,  or  the  United  Stales  Pharmacopoeia,  or  any  supplement  to  them, 

o  Intended  for  use  m  the  diagnosis  of  disease  or  other  conditions,  or  in  the  cure,  mitigation,  treatment,  or  prevention  of  disease,  in 

man  or  olher  animals,  or 

o  Intended  to  affect  the  structure  or  any  function  of  the  body  of  man  or  ether  animals,  and  which  dees  not  achieve  any  of  it's  primary 

intended  purposes  through  chemical  action  within  or  on  the  body  of  man  cr  other  animals  and  which  is  not  dependent  upon  being 

metabolized  for  the  achievement  of  any  of  its  primary  intended  purposes. 


U.  Biological  Specimens 

33.  Will  biological  specimens  (inefeiding  blood,  unre  and  other  human-derived  samples)  be  used  in  this  study? 

_ Yes  [Complete  4  Submit  Appendix  15 

El  No 

MOTE:  If  the  response  to  Question  B3  is  YES.  an  application  must  be  submitted  to  the  Institutional  Biosafety  Committee  (IBC). 
The  ISC  Applications  may  be  located  at  h ttp www.researcti.psu .edir/orp/areasibiohaza ndousfappl icationsrindea.as p. 

V.  Other  Biomedical  Procedures  -  Diagnostic  Radiation  Procedures.  Physical  Activity,  Diet  Modifications 

34.  Will  participants  be  asked  to  undergo  diagnostic  radiation  procedures  while  enrolled  in  this  study? 

Yes  jCcmplele  &  Submit  Appendix  16 

El  No 

3b.  Will  participants  be  required  1c  engage  in  or  perform  any  form  of  physical  activity? 

X  Yes  Describe  tfie  nature  and  extent  of  the  physical  activity:  They  will  have  to  use  a  pen  and  paper,  and  they  will  use 
a  sard  and  mouse. 


No 


36.  Will  any  type  of  electrical  equipment  other  than  audio  headphones  be  attached  to  the  participants  (e.g.,  EMG.  EKG)? 


'Id  13-  Version  1.1-  Created  UI.'ZHJG,  Reuwc 
Tm  1cm  te  ava  las’e  afetaical!,-  at  Kb-are‘ai 


59 


'  I  Yes  Submit  a  letter  describing  tbe  most  recent  safely  check  of  the  equipment  with  the  sup  parting  documents  for  this 


37.  Will  there  be  any  diet  modifications  or  restrictions? 

Ell  Yes  Describe: 

El  No 

W.  Assurances 

As  the  pnrcipal  investigator  on  this  research  study,  I  assure  that... 

1 .  this  application,  if  funded  by  an  extramural  source,  accurately  reflects  all  procedures  involving  human  participants  described  in  the 
grant  proposal  to  the  funding  agency  previously  noted  or  an  explanation  is  given  for  any  differences. 

2.  I  will  oblarn  approval  from  the  Institutional  Review  Beard  ftRB)  before  initiating  any  changes  lo  the  approved  study,  including 
charges  in  procedures,  personnel,  documents,  instruments,  elc,,  except  where  necessary  to  elinmale  apparent  immediate 
hazards  to  participants.  In  the  fetter  rr  stance.  the  IR3  must  be  notified  by  the  next  workday, 

3.  I  am  familiar  with  and  wil  comply  with  all  pertinent  institutional,  local,  state,  and  Federal  regulations  and  policies.  I  wi  adhere  be 
the  pclrcresand  procedures  desorbed  in  Penn  Stale's  Federalwide  Assurance  with  Ihe  Office  for  Human  Research  Prole clions  as 
we!  as  Federal  regulations  for  fhe  protection  of  human  participants  involved  in  research  {45CFR4S;  21CFR  parts  50  ft  56).  Copies 
of  these  documen  ts  are  available  in  the  ORP  upon  request  or  on  their  website  -  h  bp  J'W.wi  .research  .psL.edu.'crp,1. 

4.  the  information  provided  in  this  application  reasonably  summarizes  the  nature  and  extent  of  the  proposed  use  of  human 
partidpants. 

5.  I  will  notify  the  IRB  within  5  business  days  regarding  any  significant  adverse  events  lhat  impact  human  participants. 

6.  all  individuals  listed  on  this  form  are  competent  and  have  been  properly  trained.  I  also  assure  that  all  mdrvTc'uafs  will  complete  the 
required  training  for  the  protection  of  human  participants  available  on-line  prior  to  contact  with  human  participants. 

7.  any  individual  associated  with  or  responsible  for  the  design,  the  conduct,  or  the  reporting  of  this  research  will  comply  with  Penn 
Slate  s  Conflict  of  Interest  Foley,  RA-05. 


Signature  of  Principal  Investigator,  REQUIRED 


Date 


I  hereby  confirm  that  I  have  read  this  application  and  my  signature  denotes  the  completeness  and  accuracy  of  the  information  provided. 


PRINT  Name  of  Faculty  Advisor,  REQU  IRE.D  IF  Pi  IS  A  SI  U  DENT 

SIGNATURE  of  Faculty  Advisor.  REQUIRED  [F  PI  IS  A  STUDENT   Date 


t  hereby  confirm  that  I  have  read  this  application  and  my  signature  denotes  deparlmental/unit  approval  of  this  prefect.  To  the  best  of  my 
knowledge,  the  rr  formatter  in  the  attached  application  relating  bo  members  of  my  department  is  correct. 

The  investigator! s)  who  are  members  of  my  department  are  qualified  to  perform  the  roles  proposed  for  them  in  this  application.  Any 
novice  researchers  from  my  department  will  be  supervised  by  qualified  investigators. 


P^e  J2  al  13 -Version  11-  Creeled  i'l.'ZMJB,  Remwc  TOiZDE 

Tfri  lorn  i&  ava  lame  atertnonicaly  at  MlE.Vfl-wo.'esgarclMEk.  K^arc-aiiea^.iyafts.'axl  cal  ens:  rde«.  asa. 


60 


PRINT  Name  of  FIsDepartmerbUnit  Head.  REQUIRED 


SIGNATURE  of  Pi's  Departmerit/Unit  Head,  REQUIRED _ Date 


Page  J3  al  13- Version  1.1-  Creeled  i'l.'JMJB,  Rensec  TOtfDE 

Tm  *ot n  i&  ava  las’e  eSedrcnscal^  at  r-Ne  .v.™*,ese.=rcr'.ES;.  ecL-ait-aneas ;xmans.^xl  cal  ons.'  ide«.  asa. 


61 


9  References 

American  Psychological  Association.  (2001).  The  publication  manual  of  the  American 
psychological  association  (5  ed.).  New  York:  American  Psychological  Associations. 

Campbell,  D.  T.,  &  Stanley,  J.  C.  (1963).  Experimental  and  quasi-experimental  designs  for 
research.  Boston,  MA:  Houghton  Mifflin  Company. 

Cozby,  P.  C.  (2004).  Methods  in  behavioral  research  (8th  ed.).  New  York,  NY:  McGraw-Hill. 

Ebbinghaus,  H.  (1885/1913).  Memory:  A  contribution  to  experimental  psychology. 

Ericsson,  K.  A.,  &  Simon,  H.  A.  (1993).  Protocol  analysis:  Verbal  reports  as  data.  Cambridge, 

MA:  Bradford  Books/MIT  Press. 

Fitts,  P.  M.  (1954).  The  information  capacity  of  the  human  motor  system  in  controlling  amplitude  of 
movement.  Journal  of  Experimental  Psychology,  47(6),  381-391. 

Keppel,  G.,  &  Wickens,  T.  D.  (2004).  Design  and  analysis:  A  researcher's  handbook.  Upper 
Saddle  River:  NJ:  Prentice  Hall/Pearson  Education. 

Kim,  J.  W.  (2008).  Procedural  skills:  From  learning  to  forgetting.  Unpublished  doctoral 
dissertation,  The  Pennsylvania  State  University,  University  Park,  PA. 

Kim,  J.  W.,  Koubek,  R.  J.,  &  Ritter,  F.  E.  (2007).  Investigation  of  procedural  skills  degradation 
from  different  modalities.  In  R.  L.  Lewis,  T.  A.  Polk  &  J.  E.  Laird  (Eds.),  Proceedings  of  the 
8th  International  Conference  on  Cognitive  Modeling  (pp.  255-260).  Oxford,  UK:  Taylor  & 
Francis/Psychology  Press. 

Kim,  J.  W.,  &  Ritter,  F.  E.  (2007).  Automatically  recording  keystrokes  in  public  clusters  with 
RUI:  Issues  and  sample  answers.  In  D.  S.  McNamara  &  J.  G.  Trafton  (Eds.),  Proceedings  of 
the  29th  Annual  Cognitive  Science  Society  (p.  1787).  Austin,  TX:  Cognitive  Science  Society. 

Kukreja,  U.,  Stevenson,  W.  E.,  &  Ritter,  F.  E.  (2006).  RUI:  Recording  user  input  from  interfaces 
under  Window  and  Mac  OS  X.  Behavior  Research  Methods,  38(4),  656-659. 

MacWhinney,  B.,  St.  James,  J.,  Schunn,  C.,  Li,  P.,  &  Schneider,  W.  (2001).  STEP — A  system  for 
teaching  experimental  psychological  using  E-Prime.  Behavioral  Research  Methods, 

Instruments,  &  Computers,  33(2),  287-296. 

Mitchell,  M.  L.,  &  Jolley,  J.  M.  (2007).  Research  design  explained.  Belmont,  CA:  Thomson. 

Montgomery,  D.  C.  (2001).  Design  and  analysis  of  experiments  (5th  ed.).  New  York,  NY:  John 
Wiley  &  Sons. 

Nerb,  J.,  Spada,  H.,  &  Ernst,  A.  M.  (1997).  A  cognitive  model  of  agents  in  a  commons  dilemma. 

In  Proceedings  of  the  19  th  Annual  Conference  of  the  Cognitive  Science  Society  (pp.  560- 
565).  Mahwah,  NJ.:  Erlbaum. 

Newell,  A.,  &  Simon,  H.  A.  (1972).  Human  problem  solving.  Englewood  Cliffs,  NJ:  Prentice- 
Hall. 

Nielsen,  J.  (1994).  Usability  laboratories.  Behaviour  &  Information  Technology,  73(1-2),  3-8. 

Ray,  W.  J.  (2003).  Methods:  Toward  a  science  of  behavior  and  experience  (7th  ed.).  Belmont, 

CA:  Wadsworth/Thompson  Learning. 

Reder,  L.  M.,  &  Ritter,  F.  E.  (1992).  What  determines  initial  feeling  of  knowing?  Familiarity  with 
question  terms,  not  with  the  answer.  Journal  of  Experimental  Psychology:  Learning, 

Memory,  and  Cognition,  18(3),  435-451. 


62 


Rempel,  D.,  Willms,  K.,  Anshel,  J.,  Jaschinski,  W.,  &  Sheedy,  J.  (2007).  The  effects  of  visual 
display  distance  on  eye  accommodation,  head  posture,  and  vision  and  neck  symptoms. 
Human  Factors,  49(5),  830-838. 

Ritter,  F.  E.,  &  Larkin,  J.  H.  (1994).  Developing  process  models  as  summaries  of  HCI  action 
sequences.  Human-Computer  Interaction,  9,  345-383. 

Ritter,  F.  E.,  &  Wood,  A.  B.  (2005).  Dismal:  A  spreadsheet  for  sequential  data  analysis  and  HCI 
experimentation.  Behavior  Research  Methods,  37(1),  71-81. 

Roediger,  H.  (2004).  What  should  they  be  called?  APS  Observer,  77(4),  46-48. 

Rosson,  M.  B.,  &  Carroll,  J.  M.  (2002).  Usability  engineering:  Scenario-based  development  of 
human-computer  interaction.  San  Francisco,  CA:  Morgan  Kaufmann  Publishers. 

Salvucci,  D.  D.  (in  press).  Rapid  prototyping  and  evaluation  of  in-vehicle  interfaces.  ACM 
Transactions  on  Computer-Human  Interaction. 

Schoelles,  M.  J.,  &  Gray,  W.  D.  (2001).  Argus:  A  suite  of  tools  for  research  in  complex 
cognition.  Behavior  Research  Methods,  Instruments,  &  Computers,  33(2),  130-140. 

Stern,  R.  M.,  Ray,  W.  J.,  &  Quigley,  K.  S.  (2001).  Psychophysiological  recording  (2nd  ed.).  New 
York,  NY:  Oxford  University  Press. 

VanLehn,  K.  (2007).  Getting  out  of  order:  Avoiding  lesson  effects  through  instruction.  In  F.  E. 
Ritter,  J.  Nerb,  T.  O'Shea  &  E.  Lehtinen  (Eds.),  In  order  to  learn:  How  the  sequences  of 
topics  affect  learning  (pp.  169-179).  New  York,  NY:  Oxford  University  Press. 


63 


